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(57) Abstract 

The nucleotide sequences for tomato spotted wilt virus (TSWV) nucleocapsid is described, and transgenic plants containing the 
riucleocapsid nucleotide sequence from a TSMV isolate is shown to provide resistance in the transgenic plant to Tospoviruses from different 
serogroups. In addition, transgenic plants containing the nucleocapsid nucleotide sequence from a lettuce isolate of TSWV were produced 
and shown to provide (in plants producing small amounts of the nucleocapsid protein) resistance in the transgenic plant to both homologous 
and closely related viral isolates whereas plants producing larger amounts of the nucleocapsid protein possessed moderate levels of protection 
against both the homologous isolate and isolates of distantly related Impatiens necrotic spot virus (INSV). 
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TOMATO SPOTTED WILT VIRUS 

Viruses in the tospovirus genus Infect a wide variety of plant 
species, particularly tobacco, peanut, vegetables and ornamental plants. 
5 two virus specieSj torfiatd spotted wilt virus (TSWV) and impatiens 
hecfotib spot virus (INsV) ire recognized Within the Tospovirus genus. 

Tomato Spotted Wilt Virus (TSWV) is unique among plant viruses 
in thai the hucleic acid-protein complex is covered by a lipoprotein 
envelope and it is the only thrip transmitted virus. This virus has 
1 0 recently been classified as the Tospovirus genus of Jhe Bunyaviridae 
family. TSWV virions contain a 29K nucleocapsid protein ("NP" or "N"), 
two membrane-associated glycoproteins (58k and 78K) and a large 
200K protein presumably for the viral transcriptase [see J. Gen. Virol. 
71:2207 (1991); Virol. 56:12 (1973); and J. Gen. Virol. 36:267 (1977)]. 

1 5 The virus genome consists of three negative-strand (-) RNAs designated 

L RNA (8900 nucleotides), M RNA (5400 nucleotides) and S RNA (2900 
nucleotides) tsee J. Gen. Virol. 36:81 (1977); J. Gen. Virol. 53:12 (1981); 
and J. Gen. Virol. 70:3469 (1989)], each of which is encapsulated by the 
NP. The partial or full-length sequences of S RNAs from three TSWV 

2 0 isolates reveals the presence of two open reading frames (ORF) with an 

ambisense gene arrangement (see J. Gen Virol. 71:1 (1990) and J. Gen. 
Virol. 72:461 (1991)]. The larger open reading frame is located on the 
viral RNA strand and has the capacity to encode a 52K nonstructural 
proteih. The smaller ORF is located on the viral complementary RNA 

2 5 strand and is translated through a subgenomic RNA into the 29K NP. 

The ambisense coding strategy is also characteristic of the 
TSWV M RNA, With the open reading frames encoding the 58K and 78K 
membrane-associated glycoproteins. The TSWV L RNA has been 
sequenced to encode a large 200K protein presumably for the viral 

3 0 transcriptase. 

Two TSWV serogroups, "L" and "I", have been identified and 
characterized based on serological analysis of the structural proteins 
and morphology of cytopathic structures [see J. Gen Virol. 71:933 
(1990) and Phytopathology 81:525 (1991)]. They have serologically 
3 5 conserved G1 and G2 glycoproteins, but the NP of the T serogroup is 
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serologically distinct from that of the "L" serogroup. Comparison of the 
NP between the "L" and "I" serogroups has shown 62% and 67% identities 
at nucleotide and amino acid levels, respectively [see J. Gen. Virol. 
72:2597 (1991)]. 

5 TSWV has a wide host range, infecting more than 360 plant 

species of 50 families and causes significant economic losses to 
vegetables and ornarhehtal plants worldwide. The "L" serogroup has 
been found extensively In field crops such as vegetables and weeds, 
While the "I" serogroup has been largely confined to ornamental crops. 

1 0 A cucurbit isolate has recently been Identified [see .Plant Disease 

68:1006 (1984)] as a distinct isolate because it systemically infects 
Watermelon and other cUrcurbits and its NP is serologically unrelated 
to that of either serogroup. Although the spread of the TSWV disease 
can sometimes be reduced by breeding resistant plants or using non- 
15 genetic approaches, complete control of the disease by these 

conventional methods has generally proven to be difficult [see Plant 
Disease 73:375 (1989)]. 

Since 1986, numerous reports have shown that transgenic plants 
with the coat protein (CP) gene of a virus are often resistant to 

2 0 infection by that virus. This phenomenon is commonly referred to as 

coat protein-mediated protection (CPMP). The degree of protection 
ranges from delay in symptom expression to the absence of disease 
symptoms and virus accumulation. Two recent Independent reports 
[see Biol. Technology 9:1363(1991) and Mol. Plant-Microbe Interact. 

2 5 5:34 (1992)] showed that transgenic tobacco plants expressing the 

nucleocapsid protein (NP) gene of TSWV are resistant to infection by 
the homologous isolate. However, since TSWV is Widespread with many 
biologically diverse Isolates, it is very important to test the 
effectiveness of the transgenic plants to resist infections by different 

3 0 TSWV isolates. The findings of the present invention expand on those of 

the previous reports by demonstrating that transgenic plants according 
to the present invention showed resistance to two heterologous 
isolates of the "L" serogroup and an isolate of the "I" serogroup. We 
also show that resistance to the two heterologous isolates of the "L" 
3 5 serogroup was mainly found in plants accumulating very low, if any, 
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levels of NP, while transgenic plants that accumulated high levels of NP 
were resistant to th6 isolate of the "I" serogroup. 

However, no resistance was observed to a Brazilian isolate, 
although the plants that accumulated high levels of the N protein did 
5 display a delay in symptom expression. This Brazilian Isolate, 

designated TSWV-B hag the N protein that was Serologically distinct 
from the "L" and "I* serogroUps and biologically differs from a curcurbit 
isolate in that the TSWV-B does hot systemically infect melons or 
squash . Therefore, brie Aspect of the present Invention is to 
1 0 characterize the TSWV-B by cloning and sequencing of its S RNA and 
comparisons with the published sequences of other TSWV isolates. 

Various aspects of the present invention Will become readily 
apparent from the detailed description of the present invention 
including the following example, figures and data. 

1 5 In the Figures; 

Fig. 1 depicts the strategy for cloning the NP gene from viral RNA 
according to the present Invention; 

Fig. 2 depicts the in vivo transient expression of the nucleocapsid 
protein (NP) gene of tomato spotted wilt virus according to the present 

2 0 invention in . tobacco protoplasts; 

Fig. 3 depicts the location of the sequenced cDNA clones in the 
TSWV-B S RNA according to the present invention; 

Fig. 4 depicts a dehdogram showing relationships among TSWV 
isolates according to the present invention; 

2 5 Fig. 5 depicts the serological relationship of TSWV isolates 

described herein; 

Fig. 6 depicts the correlation of the level of nucleocapsid protein 
(NP) accumulation in transgenic plants with the degree of resistance to 
TSWV isolates; 

3 0 Fig. 7 depicts the TSWV-BL N coding sequences Introduced into 

transgenic plants In accordance with one aspect of the present 
invention; and 

Fig. 8 depicts the TSWV-BL half N gene fragments introduced into 
plants in accordance with one aspect of the present invention. 
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More specifically, figure 2 depicts transient expression of the NP 
gene in which the constructs were transferred into tobacco mesophyll 
protoplasts using polyethylene glycol (PEG). The transformed 
protoplasts were subsequently Incubated for two days for the 
5 expression of the NP gene. Proteins Were extracted from the 

protoplasts and tested for the MP by double, antibody sandwich enzyme- 
linked immunosorbent assay (DAS-ELISA) using antibodies against the 
TSWV NP. NP _ and NP + represent the protoplasts transformed with 
plasrhids pBI525-NP" and pBI525-NP+, respectively. Concentration of 

1 0 the antibodies for coating: 5 u.g/ml: dilution of the enzyme conjugate: 

1:250. Data Were taken 30, 60 and 90 min. after addition of substrate. 

In figure 3, the five overlapping cDNA clones are shown to scale 
below a S RNA map of TSWV-B. These clones were synthesized with 
random primers from double-stranded RNA isolated from N. benthamiana 
15 plants Infected with TSWV-B. 

In figure 4, the sequences were compared using the pileup 
program of the GCG Sequence analysis software package. Horizontal 
lines are proportional to the genetic distance while vertical lines are 
of arbitrary length and have no significance. 

2 0 More specifically, in figure 5, N. benthamiana Domin. were 

infected with TSWV isolates [TSWV-BL (a lettuce isolate), Arkansas, 
10W pakchoy (TSWV-10W), Begonia, and Brazil (TSWV-B)). An infected 
leaf disc (0.05 gram) Was ground in 12 ml of the enzyme conjugate 
buffer and analyzed by DAS-ELISA using antibodies raised against 

2 5 TSWV-BL viron (BL Viron), or the NP of TSWV.BL (BL-NP), or TSWV-I (I- 

NP). Concentration of antibodies for coating were 1u.g/ml; dilution of 
conjugates were 1:2000 for BL viron, 1:250 for BL-KlP, and 1:1000 for I- 
NP. The results were taken after 10 minutes (BL), 50 minutes (BL-NP), 
or 30 minutes after adding substrate. 

3 0 With regard to figure 6, transgenic plants were assayed in DAS- 

ELISA for NP accumulation With antibodies raised against the NP of 
TSWV-BL. Plants Were read 150 min. after adding substrate and the 
transgenic plants Were grouped Into four categories: OD4Q5nm smaller 
than 0.050, OD405nm between 0.050 to 0.200, OD405nm between 0.200 
3 5 to* 0.400, and OD405nm greater than 0.400. The OD405nm readings of 

4 
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control NP (-) plants were from zero to 0.05. The same plants were 
challenged with either the Arkansas (Ark) and 10W pakchoy (10W) 
isolates or the Begonia isolate and the susceptibility of each plant was 
recorded about 12 days after inoculation. The results were pooled from 
5 fifty-one Ri NP (+) plarits inoculated with the Arkansas and 10W 

pakchoy isolates and one hundred thirty-nine R1 NP(+) plants inoculated 
with the Begonia isolate. Numbers above bars represent total numbers 
of Ri NP(+) plants tested. 

EXAMPLE I 
10 Isolation of TSWV-&L RNAs: 

the TSWV^BL isolate was purified from Datura stramonium L. as 
follows: the infected tissues Were ground in a Waring Blender tor 45 
sec with three volumes of a buffer (0.033 M KH2PO4, 0.067 MK2HPO4, 
0.01 M Na2S03). The homogenate was filtered through 4 layers of 

1 5 cheesecloth moistened With the above buffer and centrifuged at 7,000 

rpm for 15 min. The pellet Was resuspended in an amount of 0.01 M 
Na2S03 equal to the original weight of tissue and centrifuged again at 
8,000 rpm for 15 fnin. After the supernatant was resuspended in an 
amount of 0.01 M Na2S03 equal to 1/10 of the original tissue weight. 

2 0 The virus extract was centrifuged at 9,000 rpm for 15 min. and the 

supernatant was carefully loaded on a 10-40% sucrose step gradient 
made up in 0.01 M Na2S03. After centrifugation at 23,000 rpm for 35 
min., the virus zone (about 3 cm below meniscus) Was collected and 
diluted with two volumes of 0.01 M Na2S03. The semi-purified virus 

2 5 was pelleted at 27,000 rpm for 55 min. 

EXAMPLE II 

Purification of TSWV arid viral RNAs: 

The TSWV-BL isolate [see Plant Disease 74:154 (1990)] Was 
purified from Datura stramonium L, as described in Example I. The 

3 0 purified virus was resuspended in a solution of 0.04% of bentonite, 10 

ng/ml of proteinase K, 0.1 M ammonium carbonate, 0.1% (w/v) of 
sodium diethyldithiocarbanate, 1 mM EDTA, and 1% (w/v) of sodium 
dodecyl sulfate (SDS), incubated at 65°C for 5 min., and immediately 
extracted from H20-saturated phenol, followed by another extraction 
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with chloroform/isoatayl alcohol (24:1). Viral RNAs were precipitated 
in 2.5 volumes of ethahol and dissolved in distilled H2O. 

EXAMPLE III 

cDNA and PCR-based NP gene cloning: 
5 The first strand cDNA Was synthesized from' purified TSWV-BL 

RNAs using random primers as described by Gubler and Hoffman (see 
Gene 25:263 (1983)). The second strand was produced by treatment of 
the sample with RNase H/DNA polymerase. The resulting, double- 
stranded cDNA sample was size-fractionated by sucrose gradient 
1 0 centrifugation, methylated by EcoRI methylase, and >EcoRI linkers were 
added. After digestion with EcoRI, the cDNA sample Was ligated into 
the EcoRI site of pUC18, whose 5'-terminal phosphate groups Were 
removed by treatment with calf intestinal alkaline phosphotase. E. coli 
DH5 a competent cells (Bethesda Research Laboratories) were 

1 5 transformed and clones containing TSWV cDNA inserts were first 

selected by plating oh agar plates containing 50 p.g/ml of ampicillin, 
IPTG, and X-gal. Plasmid DMAs from selected clones were isolated 
using; an alkaline lysis procedure (see BRL Focus 11:7 (1989)], and the 
insert sizes were determined by EcoRI restriction enzyme digestion 

2 0 followed by DNA transfer onto GeheScreen Plus nylon filters (DuPont). 

Plasmid clones that contained a TSWV-BL S RNA cDNA insert were 
identified as described below by hybridizing against a 3 ^P-labelled 
oligomer (AGCAGGCAAAACTCGCAGAACTTGC) complementary to the 
nucleotide sequence (GCAAGTTCTGCGAGTTTTGCCTGCT) of the TSWV- 

2 5 CPNH1 S RNA [see J. Geh. Virol. 71:001 (1990)]. Several clones were 

identified and analyzed on agarose gels to determine the insert sizes. 
The clones pTSWVS-23 Was found to contain the largest cDNA insert, 
about 1.7 kb in length. 

The full-length NP gene Was obtained by the use of polymerase 

3 0 chain reaction (PCR). First-strand cDNA synthesis was carried out at 

37°C tor 30 min. in a 20 U.I reaction mixture using oligomer primer 
JLS90-46 (5'-> 3') AGCTAACCATGGTTAAGCTCACTAAGGAAAGQ (also 
used to synthesize the nucleocapsld gene of TSWV-10W) which is 
complementary to the S RNA in the 5' terminus of TSWV NP gene 
3 5 (nucleotide positions 2751 to 2773 of the TSWV-CPNH1). The reaction 



mixture contained 1.5 u.g of viral RNAs,1 ng of the oligomer primer, 0.2 
mM of each dNTP, 1X PCR buffer (the GeneAmp kit, Perkin-Elmer-Cetus), 
20U of RNAs in RibonUclease inhibitor (Promega), 2.5 mM of MgCl2, and 
25U of AMV reverse transcriptase (Promega Corporation). The reaction 
5 was terminated by heating at 95°C for 5 min. and cooled on ice. Then 10 
jil of the cDNA/RNA hybrid was used to PCR-amplify the NP gene 
according to manufacturer's instructions (Perkin-Elmer-Cetus) using 1 
tig each of oligomer prlttl#rs JLS90-46 and JLS90-47 (5'->3'), 
AGCATTCCATGGTTAACACACTAAGCAAGCAC (also used to synthesize the 
10 nucleotide gene of TSWV-10W), the latter oligomer, being identical to 
the S RNA in the 3' hohcoding region of the gene (nucleotide positions 
1919 to 1938 of the TSWV-CPNH1). A typical PCR cycle Was 1 min. at 
92°C (denaturing), 1 min. at 50°C (annealing), and 2 min. at 72°C 
(polymerizing). The sample was directly loaded and separated on a 1.2% 

1 5 agarose gel. The separated NP gene fragment was extracted from the 

agarose gel, ethanol-precipitated and dissolved in 20 u.l of distilled 
H2O. 

EXAMPLE IV 

Construction of plant expression and transformation vectors. 

2 0 The gel-isolated NP gene fragment from Example III was digested 

with the restriction enzyme Ncol in 50 u.l of a reaction buffer [50 mM 
Tris-HCI (pH 8.0), 10 mM MgCl2, 0.1 M NaCI] at 37°C for 3 hours, and 
directly cloned into Nco /-digested plant expression vector pB1525. The 
resulting plasmids Were identified and designated as pB1525-NP+ in 

2 5 the sense orientation relative to the cauliflower mosaic virus (CaMV) 

35S promoter, and as pBi525-NP"in the reverse orientation. The 
ability of this expression cassette to produce the NP was determined by 
transient expression of the NP gene in Nicotiana tobacum protoplasts, 
as described by Pang et al [see Gene 112:229 (1992)]. The expression 
. 3 0 cassette containing the NP gene Was then excised from pB1525-NP+ by 
a partial digestion with H/ncflll/EcoRI (since the NP gene contains 
internal Hind\\\ and EcoRI sites), and ligated into the plant . 
transformation vector pBINl9 (Clontech Laboratories, Inc.) that had 
been cut with the same enzymes. The resulting vector, pBIN19-NP+ and 

3 5 the control plasmid pBIN19 Were transferred to A. tumefaciens strain 




LBA4404, using the procedure described by Holsters et al [see Mol. Gen. 
Genet. 163:181 (1978)]. 

Nucleotide sequence analyses of the inserts in clones pTSWV-23 
and Pb1525-NP+were determined using the dideoxyribonucleotide 
method, T7 polymerase (U.S. Blochemicals, Sequenase™), and the 
double-stranded sequencing procedure described by Siemieniak et al 
(see Analyt. Biochefn. 192:441 (1991)]. Nucleotide sequences were 
determined from both DNA strands and this Information was compared 
With the published sequences of TSWV isolates CPNH1 using computer 
programs available from the Genetics Computer Group (GCG, Madison, 
Wl). 

Transient expression of the NP gene in tobacco protoplasts were 
also prepared. Plasmid DNAs for clones pTSWVS-23 and pUC18cpphas 
TSWV-NP (containing the PCR-engineered NP gene insert) were isolated 
using the large scale alkaline method. The PCR-engineered NP gene 
insert was excised from clone pBIS25-NP+ by Ncol digestion to take 
advantage of the available flanking oligomer primers for sequencing. 
The expression cassette pUC18cpphas is similar to pUCl8cpexp except 
that it utilizes the poly(A) addition signal derived from the Phaseolus 
vulgaris seed storage gene phaseolin. These plasmid DNAs were 
subjected to two CsCI-ethidium bromide gradient bandings, using a 
Beckman Ti 70.1 fixed angle rotor. DNA sequences were obtained using 
dideoxyribonucleotides and the double-stranded plasmid DNA sequencing 
procedure described above. Nucleotide sequence reactions were 
electrophoresed on one-meter long thermostated (55°C) sequencing gels 
and nucleotide sequence readings averaging about 750 bp Were obtained. 
Nucleotide sequences Were determined from both DNA strands of both 
cloned inserts to ensure accuracy. Nucleotide sequence information 
from the TSWV-BL S RNA isolate was compared as discussed below, 
with TSWV isolates CPNH1 and L3 using computer programs (GCG, 
Madison, Wl). 

The nucleotide and deduced amino acid sequences of cloned cDNA 
and PCR-engineered Insert of TSWV-BL S RNA and their comparison 
with the nucleotide sequence of TSWV-CPHN1 S RNA are shown below. 
The nucleotide sequence of the TSWV-BL S RNA clones pTSWVS-23 
8 
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(TSWV-23) and pBI525-NP+ (TSWV-PCR) were obtained using the 
double-stranded dideoxynucleotide sequencing procedure of Siemieniak, 
and their sequences ara compared with the relevant regions of the 
nucleotide sequence bt the TSWV-CPNH1 S RNA reported in GeneBank 
5 Accession No. D00645. the nucleotide sequence of TSWV-CPNH1 S RNA 
has been reported by D6 Haan (1990) and is represented by the following 
sequence: 

CAAGTTGAAA GCAACAACAG AACTGTAAAT TCTCTTGCAG TGAAATCTCT 50 
GCTCAtK3TCA GCAGAAAACA ACATCATQCC TAACTCTCAA QCTICCACTG 100 
1 0 ATTX7TCATTT CAAGCTGAQC CTCTGGCIAA GGGTTCCAAA ' GGTTTTGAAG 150 
CAGG1TICCA. TTCAGAAATT GTKAAGGIT GCAGGAGATG AAACAAACAA 200 
AACATTTIAT TTATUEAITG CCTGCA1TCC AMCCATAAC AGTGTTGAGA 250 
CAGCTTTAAA CATTACTGTT ATTTGCAAGC ATCAGCTCCC AATTCGCAAA 300 
TGCAAAGCTC CmTGAATT ATCAATGATG TTTTCTGA.TT TAAAGGAGOC 350 

1 5 TTACAACATT GTTCATGACC CTICAIACCC CAAAGGATCG GTTOCAATGC ' 400 

TCTQGCTCGA AACTCACACA TCTTIGCACA AGTTCTTTGC AACTAACTIG 450 
CAAGAAGATG TAATCATCTA CACTTTGAAC AACCTTGAGC TAACTCCTGG 500 
AAAGTTAGAT TTAGGTGAAA GAACCTTGAA TTACAGTGAA GATGCCTACA 550 
AAAGGAAAIA TTKXTTTCA AAAACACTTG AATGTCTTCC ATCTAACACA 600 

2 0 CAAACTATGT CTTACTIAGA CAGCATCCAA ATCCCTTCAT GGAAGATAGA 650 

CTTIGCCAGA GGAGAAATTA AAATTTCTCC ACAATCTATT TCAGTTGCAA 700 
AATCTTTCTT AAAGCTTGAT TTAAQCGQGA TCAAAAAGAA AGAATCTAAG 750 
GTTAAGGAAG CGTATGCTTC AGGATCAAAA TAATCTTGCT TTGTCCAGCT 800 
TTTTCTAAIT ATGTEATGTT TAITI'ILTIT CTTEACTTAT AATTATITCT 850 

2 5 CTGTITGTCA TCTCTTTCAA ATTCCTCCTG TCTAGTAGAA ACCAIAAAAA 900 

CAAAAAATAA aaatgaaaat aaaattaaaa taaaataaaa TCAAAAAATC 1000 
AAATAAAAAC AACAAAAAAT TAAAAAACGA AAAACCAAAA AGAOCCGAAA 1050 
GGGACCAATT TGGCCAAATT TGGGTTITGT TTITGTTTTT TGTTTTTTGT 1100 
TTTTTATTIT TTATTTTA3T TTTATTTTAT TTEATTTTTA TlTilAl'lTlT 1150 

3 0 AlTimTTTA ITITrTGTIT TCGTTGTTTT TGITA1TTTA TTATTTATTA 1200 

AQCACAACAC ACAGAAAGCA AACTTEAATT AAACACACTT AJTTAAAATr 1250 
mCACAm.AGCAA(X»CA AGCAAIAAAG ATAAAGAAAG CTTTATAIAT 1300 
TTATAGGCTT T1T1ATAATT TAACTTACAG CTGCTTTCAA GCAAGTTCTG 1350 
CGAGTnTGC CTGCTITTTA ACCCCGAACA TTDCATAGAA CTTGTTAAGA 1400 
3 5 GTTTCACTGT AATGTTCCAT AGCAACACTC CCTTEAGCAT TAGGATTGCT 1450 
9 




GGAGCTAAGT ATAGCAGCAT ACTCTTTOCC CTTCTTCACC TGATCTTCAT 1500 

TCATTTCAAA TGCTITGCIT TTCAGCACAG TGGAAACTIT TCCTAAGGCT 1550 

TCCTTGGTGT CATACTTCTT TGGGTCGATC COGAQGTCCT TGTA1TTTGC 1600 

ATCCTGATAT ATAGCCAAGA CAACACTGAT CATCTCAAAG CTATCAACTG 1650 

5 AAGCAATAAG AGGTAAGCTA CCTCOCAQCA TTATQGGAAG TCTCACAGAC 1700 

TTTGCATCAT OGAGAGGTAA TCCATAGGCT TGAATCAAAG GATGGGAAGC 1750 

AATCTTAGAT TIGATAGTAT TGAGATTCTC AGAAITOCCA GTITCTTC1AA 1800 

C^QCCIGAC CCTGATCAAG CTATCAAGCC TTCTGAAGGT CATGTCAGTG 1850 

CCTGCAATOC TGTCTGAAGT TTTCTTTATG GTAATTTEAC CAAAAGTAAA 1900 

1 0 ATCGCTTTGC TTAATAACCT TCATTATGCT CTGAOGATTC TTEAGGAATG 1950 

TCAGACATGA AATAACGCTC ATCTTCTTGA TCTGGTOGAT GTnTCCAGA 2000 

CAAAAAGTCT TGAAGTTGAA TGCTACCAGA TTCTGATCTT OCTCAAACTC 2050 

AAGGTCTTTG CCTTGTGTCA ACAAAQCAAC AAIGCmCC..ITAG^GAGCT 2100 

TAACC.TTaGA CATGATGATC GTAAAAGTTG TTATAGCTTT GACCGTATGT 2150 

1 5 AACTCAAGGT GCGAAAGTGC AACTCTGTAT CCCGCAGTCG TTTCTTAGGT 2200 

TCTTAATGTG ATGATTTGTA AGACTGAGTG TIAACGTATG AACACAAAAT 2250 
TGACACGATT GCTCT 2265 

The incomplete deduced amino acid sequence of the nonstructural 
protein gene on TSWV-CPNH1 S RNA is provided below beginning with 

2 0 nucleic acid at position 1 and ending with the nucleic acid codon ending 

at position 783: 
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95 








100 




- 105 


Glu 


Leu 


Ser 


Met 


Mat 


Phe Ser 


Asp 


Leu 


Lys 


Glu 


Pro Tyr Asn lie 










110 








115 




120 


Val 


His 


Asp 


Pro 


Ser 


Tyr Pro 


Lys 


Gly 


Ser 


Val 


Pro Met Leu Trp 



125 130 135 
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Leu Glu Thr His Thr Ser Leu His Lys Phe Phe Ala Thr Asn Leu 

140 145 150 

Gin Glu Asp Val He He Tyr Thr Leu Asn Asn Leu Glu Leu Thr 

155 160 165 

Pro Gly Lys Leu Asp Leu Gly Glu Arg Thr Leu hSQ IXZ SSL Glu 

170 175 180 

Asp Ala Tyr Lys Arg Asp Tyr Phe Leu Ser Lys Thr Leu Glu Cys 

185 190 195 

Leu Pro Ser Asn Thr Gin Thr M=t Ser Tyr Leu Asp Ser He Gin 

200 205 210 

He Pro Ser Trp Lys He Asp Phe Ala Arg Gly Glu He Lys He 

215 220 , 225 

Ser Pro Gin Ser He Ser Val Ala Lys Ser Leu Leu Lys Leu Asp 

230 235 240 

Leu Ser Gly He Lys Lys Lys Glu Ser Lys Val Lys Glu Ala Tyr 

245 250 255 

Ala Ser Gly Ser Lys 
260 

The nucleotide sequence for TSWV-23 depicted below compares 
closely with the TWSV sequence given above, and contains one-half of 
the nonstructural gene arid one half of the nucleocapsid protein gene. 

AAATTCTCTT GCAGTGAAAT CTCTGCTCAT GTTAGCAGAA AACAACATCA 50 
TGCCTAACTC TCMGCTTTT GTCAAAGCTT CTACTGATTC TAATTTCAAG 100 
CTGAGCCTCT GGCTAAGGGT TCCAAAGGTT TTGAAGCAGA TTTCCATTCA 150 
GAAATTGTTC AAGGTTGCAG GAGATGAAAC AAATAAAACA TTTTATTTAT 200 
CTATTGCCTG CATTCCAAAC CATAACAGTG TTGAGACAGC TTTAAACATT 250 
ACTGTTATrT GCAAGCATCA GCTOCCAATT CGTAAATGTA AAACTCCTTT 300 
TGAATTATCA ATGATGTTTT CTGATTTAAA GGAGOCTTAC AACATTATTC 350 
ATGATCCTTC ATATCCCCAA AGGATTGTTC ATGCTCTGCT TGAAACTCAC 400 
ACATCTTTTG CACAAGTTCT TTGCAACAAC TTGCAAGAAG ATGTGATCAT 450 
CTACACCTTG AACAAOCATG AGCTAACTCC TGGAMGTTA GATTTAGGTG 500 
AAATAACTTT GAATTACAAT GAAGACGCCT ACAAAAGGAA ATATTTCCTT 550 
TCAAAAACAC TTGAATGTCT TCCATCTAAC ATACAAACTA TGTCTTATTT 600 
AGACAGCATC CAAATOCCTT CCTGGAAGAT AGACTTTGCC AGGGGAGAAA 650 
TLAAAATTTC TCCACAATCT A1TTCAGTTG CAAAATCTTT GTTAAATCTT 700 
GATTTAAGCG GGATTAAAAA GAAAGAATCT AAGATLAAGG AAGCATATGC 750 
TTCAGGATCA AAATGATCTT GCTGTGTCCA GCTTTTTCTA ATTATGTTAT 800 
GTTTATTTTC TTTCTTTACT TATAATTATT T1TCTGTTTG TCA1TTCTTT 850 
CAAATTCCTC CTGTCTAGTA GAAACCATAA. AAACAAAAAT AAAAATAAAA 900 
11 
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TAAAATCAAA ATAAAATAAA AATCAAAAAA TGAAAIAAAA GCAACAAAAA 950 
AATIAAAAAA CAAAAAACCA AAAAAGATCC CGAAAGGACA ATTTTGGCCA 1000 
AATTTGGGGT TTGTrtTTGT TTTITGTTTT TITGTTTTTT GTTITTATTT 1050 
TTATTTTEAT TTTTAITTTT AlTriAlTlT ATTTEATGTT TTTGITGTTr 1100 

5 rnnTArrrr gteatteatt aagcacaaca oo^gaaagcaaactttaat 1150 

TAAACAGACT TATTTAAAAT TTAACACACT AAGCAAGCACA AACAATAAA 1200 
GAIAAAGAAA GC1T1ATATA TTTATAGGCT TTTTEAIAAT TTAACTTACA 1250 
GOGCITTTA AGCAAGTTCT GTGAC^TrTTG CCTGTnTTT AACCCCAAAC 1300 
ATITCATAGA ACTTGTTAAG GGTTTCACTG TAATGITOCA TAQCAATACT 1350 
1 0 TCCTTEAGCA TTAGGATTGC TGGAGCTAAG TAIAGCAQCA'TACTCTITCC 1400 
CCTTCTTCAC CTGATCITCA TTCATTTCAA ATGCTTTICT TTTCAQCACA 1450 
GTGCAAACTT TTCCTAAGGC TTCC3CTGGTG TCATACTTCT TTGGGTCGAT 1500 
CCCGAGATCC TTGTATTTTG CATCCTGATA TATAGCCAAG ACAACACTGA 1550 
TCATCTCAAA GCTATCAACT GAAGCAATAA GAGGTAAGCT ACCTCCCAGC 1600 

1 5 ATIATGGCAA GCCTCACAGA CTITGCATCA TCAAGAGGTA ATOCATAGGC 1650 

TTGAATCAAA GGGTGGGAAG CAATCTTAGA TTTGATAGTA TTGAGATTCT 1700 
CAGAATTCC 1709 

The nucleic acid sequence for TSWV-PCR according to the present 
invention as depicted below also compares closely with the TSWV 

2 0 sequence given above and covers the whole nucleocapsid protein gene. 

TTAACACACT AAGCAAGCAC AAACAATAAA GAIAAAGAAA GCTTTATATA 50 
TTTATAGGCT TTTTTATAAT TTAACTTACA GCTGCTTTEA AGCAAGTTCT 100 
GTGAGTTTTG CCTGTTTTTT AACCCCAAAC ATTTCATAGA ACTTGTTAAG 150 
GGTTTCACTG TAATGTTCCA TAGCAATACT TCCTTTAGCA TIAGGATTGC 200 

2 5 TGGAGCTAAG TATAGCAGCA TACTCTTTCC CLTiUi'lCAC CTGATCTTCA 250 

TTCATTTCAA ATQCTTTTCT TTTCAGCACA GTGCAAACTT TTCCTAAGGC 300 
TICCCTGGTG TCATACTTCT TIGGGTCGAT CCCGAGATCC TTGTATTTTG 350 
CATCCTGATA TATAGCCAAG ACAACACTGA TCATCTCAAA GCTATCAACT 400 
GAAGCAATAA GAGGTAAGCT ACCTCCCAGC ATIATGGCAA GCCTCACAGA 450 
.3 0 CTITGCATCA TCAAGAGGTA ATCCATAGGC TTGACTCAAA GGGTGGGAAG 500 
CAATCTTAGA TTTGATAGTA TTGAGATTCT CAGAATTCCC AGTTTCCTCA 550 
ACAAGCCTGA CCCTGATCAA GCTATCAAGC CTTCTGAAGG TCATGTCAGT 600 
GGCtCCAATC CTGTCTGAAG TTTICTTTAT GGTAATTTTA CCAAAAGTAA 650 
AATCGCTTTG CTTAATAAOC TTCATTATGC TCTGACGATT CTTCAGGAAT 700 

3 5 GTCAGACATG AAATAATGCT CATCTTTTTG ATCTGGTCAA GGTTTTCCAG 750 

12 
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ACAAAAAGTC TTGAAGTTGA ATGCTACCAG ATTCTGATCT TCCTCAAACT 800 
CAAGGTCTTT GCCTTGTGTC AACAAAGCAA CAATGCTTTC CTTAGTGAGC 850 
TTAAOCAT 858 

Together the cloned TSWV-23 insert overlaps the TSWV-PCR 
5 insert and together they represent the 2028 nucleotides of the TSWV- 
BL S flNA according to the present invention. This 2028 nucleotide 
sequence according to the present invention contains a part of the 
nonstructural gene and whole hucleocapsid protein gene. The combined 
sequence is: 

1 0 AAATTCTCTT GCAGTGAAAT CTCIGCTCAT GTTAQCAGAA' AACAACATCA 50 
TGCGTAACTC TCAAGLTl'iT GTCAAAGCTT CTACTGATTC TAATTTCAAG 100 
CTGAGCCTCT GGCTAAQGGT TCCAAAGGTT TTGAAGCAGA TITCCATICA 150 
GAAATTGTTC AAGGTTGCAG GAGATGAAAC AAAIAAAACA TTTTATTTAT 200 
CTATTGCCTG CAITCCAAAC CATAACAGTG TTGAGACAGC TTEAAACATT 250 

1 5 ACTGTEATTT GCAAGCATCA GCTCCCAATT CGTAAATGTA AAACTCCTTT 300 

TGAAJTATCA ATGATbTl'lT CTGATTTAAA GGAGCCTTAC AACATTATTC 350 
ATGATCCTTC ATATGCOCAA AGGATTGTTC ATGCTCTGCT TGAAACTCAC 400 
ACATCITITG CACAAGTTCT TTGCAACAAC TTGCAAGAAG ATGTGATCAT 450 
CmCACCTTG AACAACCATG AQCTAACTCC TGGAAAGTEA GATITAGGTG 500 

2 0 AAATAACTTT .GAATTACAAT GAAGACGCCT ACAAAAGGAA ATA3TTCCIT 550 

TCAAAAACAC TTGAATGTCT TCCATCTAAC ATACAAACTA TGTCTTATTT 600 
AGACAGCATC CAAATCCCTT CCTGGAAGAT AGACTTTGCC AGGGGAGAAA 650 
TTAAAATITC TOCACAATCT ATTTCAGTrG CAAAATCTTT GTTAAATCTT 700 
GATITAAGCG GGATEAAAAA GAAAGAATCT AAGATIAAGG AAGCATATGC 750 

2 5 TTCAGGATCA AAATGATCTT GCTGTGTCCA GCTTTTTCTA ATTATGTTAT 800 

GTnAiTITC TTTCITEACT TATAATTATT TTTCTGTTTG TCATTTCTTT 850 
CAAATTCCTC CTGTCTAGTA GAAACCATAA AAACAAAAAT AAAAATAAAA 900 
TAAAATGAAA ATAAAATAAA AATCAAAAAA TGAAATAAAA GCAACAAAAA 950 
AAITAAAAAA CAAAAAACCA AAAAAGATCC CGAAAGGACA ATTTTGGOCA 1000 

3 0 AATITGGGGT TTGTTTITGT TITTTGTTTT TTTGTTTTIT GTTTITATIT 1050 

TTAlTiTlAT 1TITA1TTIT AlTlTATTET ATTTTATGTT TITGTTGTIT 1100 
TIGTTATTTT GTIATTTATT AAGCACAACA CACAGAAAGC AAACTTTAAT 1150 
TAAACACACT TATTTAAAAT TTAACACACT AAGCAAGCAC AAACAATAAA 1200 
GATAAAGAAA GCTTTATATA TTTATAGGCT TTTTTATAAT TTAACTTACA 1250 
3 5 GCTGCTTTTA AGCAAGTTCT GTGAGTTTTG CCTblTiTlT AACCCCAAAC 1300 
13 
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AITICATAGA ACTTGTTAAG GGTTTCACTG TAATGTTCCA TAGCAATACT 1350 
TOCfTTAGCA TTAGGATTGC TGGAGCTAAG TATAGCAQCA TACTCTTTCC 1400 
aTTCrTCAC CIGATCTTCA TTCATTTCAA ATGCTTTTCT TTTCAGCACA 1450 
GTGCAAACTr TTCCTAAGGC TTCCCTGGTG TCATACTICT TTGGGTCGA:: 1500 
COCGAQATCC TlGlM*tiTG CATCCTGATA TATAQCCAAG ACAACACTGA 1550 
TCAiCTCAAA GCTATCAACT GAAGCAAIAA GAGGTAAQCT ACCTCCCAGC 1600 
ATEATOGCAA GOCTCACAGA CTITQCATCA TCAAGAGGTA ATCCATAGGC 1650 
TTGACTCAAA, GGGTGGGAAG CAATCTTAGA TTTGATAGTA TTGAGATTCT 1700 
CAGAATTCCC AGTTICCTCA ACAAGCCTGA CCCTGATCAA GCEATCAAGC 1750 
CTTCTGAAGG TCATGTCAGT GGCTCCAATC CTGTCTGAAG TTTTCTnAT 1800 
GGTAAiTTTA CCAAAAG1AA AATCGCTTTG CTEAAIAACC TTCATTATGC 1850 
TCTGACGATT CTTCAGGAAT GTCAGACATG AAATAATGCT CATCTITTTG 1900 
ATCTGGTCAA GGTTITCCAG ACAAAAAGTC TTGAAGTTGA ATGCTACCAG 1950 
ATTCTGATCT TOCTCAAACT CAAGGTCTTT GCCTIGTGTC AACAAAGCAA 2000 
CAATGCnrC CTrAGTGAGC TTAACCAT 2028 

This comparison showed that cDNA insert of clone pTSWVS-23 
Included about 760 bp of the 52 K protein viral component gene, the 
complete intergenic region (492 bp), and 450 bp of the NP gene (about 
half of the MP gene). This cloned insert had its 3'-end located exactly 
at an EcoRI recognition site, which suggested incomplete EcoRI 
methylation during the cDNA cloning procedure. Although this clone did 
not contain the complete TSWV-BL NP gene, its sequence was of 
considerable importance since it had a 450 bp overlap with the 
sequence of the PCR-engineered NP gene (a total of 2028 bp of the 
TSVW-BL S RNA is presented in the nucleotide sequence for TSWV). The 
sequence comparison between this TSWV-BL PCR-engineered and TSWV- 
CPNH1 NP genes revealed a total of 21 nucleotide differences (2.7%), 
eight of which encode amino acid replacements (3.1%). Since this PCR 
engineered NP gene was obtained using Taq polymerase, which is known 
to incorporate mutations, it is possible that some of these differences 
were introduced during PCR amplification. However, 15 of these 
nucleotide differences were located within the overlapping region 
between the TSWV-BL cDNA and PCR clones, and all but one of these 
nucleotide differences (position 1702 of TSWV; position 485 of TSWV- 
PCR)) are shared by both TSWV-BL S RNA derived clones. This 
14 




comparison clearly showed that the PCR amplification did not 
contribute greatly, if at all, to the difference between the nucleotide 
sequences of these two cloned NP gene regions. The nucleotide 
difference at position 1702 resulted in the amino acid replacement of 
5 lie With Sen and eveh this difference could be due to the lack of 
homogeneity within the TSWV-BL isolate. 

EXAMPLE V 

Afirz-oftflcfer/um-medlated transformation: 

Leaf discs of Nicotiana tabacum var Havana cv 423 were 
1 0 inoculated with the Agrobacierium strain LBA4404 (ClonTech) 

containing the vector pBINl9-NP+or the control plasmid pBIN19, by 
soaking overnight in a liquid culture of the Agobacterium, and the 
inoculated leaf discs Were Incubated on non-selective MS medium for 3 
days, [see Science 227:1229 (1985)]. Transformed cells were selected 

1 5 and regenerated in MS medium containing 300 ng/ml kanamycin and 500 

uxj/ml carbenicillin for shoot regeneration. Roots were induced after 
transfer of plantlets to hormone-free medium. Rooted transformants 
were transferred to soil and grown under greenhouse conditions. The MS 
medium contains full strength MS salt (Sigma), 30 g/l sucrose, 1 mg/l 
20 BA and 1 ml of B5 vitamins (1 mg/ml Nicotinic acid, 10 mg/ml Thiamine 
(HCI), 1 mg/ml Pyridoxlne (HCI), 100 mg/ml Myo-lnositol]. Transgenic 
plants were self-pollinated and seeds were selectively germinated on 
kanamycin medium. 

EXAMPLE VI 

2 5 Serological detection of proteins: 

Double antibody sandwich enzyme-linked immunosorbent assay 
(DAS-ELISA) was used to detect the expression of NP gene in transgenic 
plants with polyclonal antibodies against the TSWV-BL NP. Each sample 
was prepared by grinding a leaf disc (about 0.05 g) from the top second 

3 0 leaf of the plant in 3 ml of an enzyme conjugate buffer [phosphate- 

buffered saline, 0.05% Tween 20, 2% polyvinylpyrrolidone 40, and 0.2% 
ovalbumin]. For tobacco protoplasts, the cell extracts after 
centrifugation were directly used for the assay. A ten- and three-fold 
dilutions of the samples from both transgenic plants and tobacco 
3 5 protoplasts were made just before DAS-ELISA. 
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For Western blots, a leaf disc (about 0.05 g) was ground in 0.25 
ml of 2X SDS/sample buffer (0.126 M Tris buffer, 20% glycerol, 2% SDS, 
2% 2-mercaptoethanol, and 0.01 mg/ml bromphenol blue). The 
homogenates were fcentrltuged and the supernatants Were boiled before 
loading. Proteins (10-20 \l\ sample/lane) were separated and blotted 
onto a membrane. The membrane Was then processed following the 
manufacturer's immuhoselect kit instruction manual (Gibco BRL Life 
Technologies Inc.). Antibodies to the whole virion were preabsorbed 
with cell extracts from health tobacco plants [See Plant Disease 70:501 
(1986)], and were used In Western blot at a concentration of 2 u,g/ml. 

Serological reactions of TSWV isolates (TSWV-BL, Arkansas, 
10W pakchoy, Begohla or Brazil) Were assayed in DAS-ELISA using 
antibodies raised against TSWV-BL virion, or the NP of TSWV-BL or 
TSWV-I. 

5 EXAMPLE VII 

Inoculation of transgenic plants with TSWV Isolates. 

Inocula Were prepared by infecting Nicotiana benthamiana Domin. 
with different TSWV Isolates and grinding infected leaves (0.5 g) of N. 
benthamiana plants (1 to 2 weeks after inoculation) In 15 ml. of a 

0 buffer (0.033 M KH2PO4, 0.067 M K2HPO4 and 0.01 M Na2SC>3). The 

inoculum extracts were immediately rubbed on corundum-dusted leaves 
of transgenic plants and the inoculated leaves were subsequently rinsed 
with H2O. Because TSWV is highly unstable in vitro after grinding, each 
batch of inoculum was used to first inoculate NP(+) plants containing 

5 the NP gene; the last inoculated plants of each inoculum were always 
control NP(-) plants containing the vector sequence alone to assure that 
a particular virus inoculum was still infective at the end of inoculation. 

Data on local lesions and systemic infections were taken 7-15 
days after inoculation and expressed in the following table as the 

0 number of plants systemlcally infected over the number of plants 

inoculated, except Where Ihdicated. In this table, the data collected 

under "ELISA" is the data of R 0 lines from which the R1 plants Were 

derived; the Begonia isolate induced local lesions on the R1 plants, an d 

the resistance Was expressed as the number of plants producing local 
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lesions over the number of plants inoculated; and NT indicates that 
there was no test. 

Reactions of Ri plants expressing the nucleocapsid protein (NP) gene of 
tomato spotted Wilt VlfU§ (TSWV) to Inoculation with TSWV isolates. 

Reactions to TSWV isolates 

EUSA: 





(ROpl.) 


Bk i 


fokdnsas 


10W 
Pakchoy 


Begonia 


Brazil 


Rn line 














NP{+)2 


0.015 


0/20 


• 4/25 


3/24 


29/40 


36/36 


NP(+)4 


0.386 


6/30 


21/23 


18/21 


' 9/48 


42/42 


NP(+)9 


0.327 


0/20 


NT 


20/20 






NP(+)14 


0.040 


0/20 




9/20 


8/18 


1 8/1 8 


NP(+)21 


0.042 


0/15 


5/15 


3/15 


2/4 


6/6 


NP(+)22 


0.142 


0/20 




15/20 


31/36 


36/36 


NP(+)23 


0.317 


0/20 




16/20 






NP(-) 




42/42 


24/24 


62/62 


66/66 


54/54 



As described above, the isolation of the TSWV-BL NP gene, which 
resides in the S RNA component of TSWV, was approached using two 
strategies. The cDNA cloning strategy yielded several clones containing 
cDNA inserts derived from TSWV-BL S RNA, as identified by 
hybridization against an oligomer probe complementary to the TSWV- 
CPNH1 S RNA. Clohe pTSWVS-23 contained the longest insert, which 
mapped at about 1.7 kb In length . The second strategy utilized the 
published sequence of TSWV-CPNH1 S RNA and PCR to amplify and 
engineer the NP gene for expression directly from total TSWV-BL RNA. 
Oligomer primers JLS90-46 and -47 were synthesized, with JLS90-46 
being complementary to the S RNA in the 5'-coding region of the NP gene 
(positions 2051-2073 of the TSWV-CPNH1) While JLS90-47 being of 
the 3'-noncoding region of the NP gene (positions 1218 to 1237 of the 
TSWV-CPNH1) . Both of the primers contain the recognition site for the 
restriction enzyme A/cot for subsequent cloning, and the primer JLS90- 
46 has a plant consensus translation Initiation codon sequence 
(AAXXATGG), which Upon amplification was expected to fuse the 
translation initiation codon to the third codon (GTT) of the NP gene . 
Fusion of the translation initiation codon to the third codon of the 

17 
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TSWV-BL NP gene Was dohe to preserve the Nco\ recognition site while 
not incorporating any arrtino acid codons. Thus, expression of the PCR- 
engineered TSWV NP gene Would yield a TSWV-BL NP that was two 
amino acids (Ser-Lys) shorter at the N-terminus than the native NP. 
5 This specif ically-atiiplif led DNA fragment, of about 850 bp, was 

digested with Ncol and cloned into the plant expression vector pB1525. 
The orientation of the TSWV-BL NP gene with respect to the CaMV 35S 
promoter Was determined by restriction enzyme site mapping (EcoRI, 
Hindlll, Aval and AtWNl). Several clones Were isolated that contain the 
10 insert in the proper orientation (pB1525-NP+) arid, others that contain 
the insert in the opposite orientation (pB1525-NP-). This restriction 
enzyme site mapping data also showed that the inserts of clones 
pB1525-NP+ contained restriction enzyme sites that were identical to 
those found in the TSWV-CPNH1 NP gene. The expression of TSWV-BL 

1 5 NP gene was thus controlled by a double CaMV 35S promoter fused to 

the 5'-untranslated leader sequence of alfalfa mosaic virus (ALMV) of 
the expression vector pB1525. Expression vectors that utilize the 
stacked double CaMV 35S promoter elements yield higher levels of 
mRNA transcription than similar vectors that utilize a single 35S 

2 0 promoter element. 

Three pB1525-NP+clones were transiently expressed in tobacco 
protoplasts to confirm that the amplified DNA fragment encoded the NP. 
To achieve this, the clones were transferred into tobacco protoplasts 
by the PEG method, and after two days of incubation the expressed NP 

2 5 was detected by DAS-ELtSA using antibodies against the whole TSWV- 

BL virion . High levels of NP were produced in tobacco protoplasts 
harboring the NP gene Ih plasmid pBl525-NP+; while no NP was 
detected in tobacco protoplasts transformed With the antisense NP 
sequence (pB1525-NP*). 

3 0 As described previously, the PCR-engineered insert of clone 

pBl525-NP+ and teh cDNA insert of the clone pTSWV-23 Were subjected 
to double stranded sequencing. The sequence analysis of the cDNA and 
the PCR clones revealed Inserts of 1.71 kb and 865 bp, respectively 
which, when compared with the sequence TSWV-CPNH1 S RNA, shows 
3 5 that cDNA insert of clone pTSWV-23 includes about 760 bp of the 52 K 
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protein viral component gene, the complete intergenic region (492 bp), 
and 450 bp of the NP gene (about one-half of the gene). This cloned 
insert has its 3'-end located exactly at an EcoRI recognition site 
suggesting incomplete £coRI methylation during the cDNA cloning 
5 procedure. Although this clone does not contain the complete TSWV-BL 
NP gene* its sequerice is of considerable importance since it has a 450 
bp overlap with the sequence of the PCR-engineered NP gene. The 
sequence comparison between this TSWV-BL PCR-engineered and TSWV- 
CPNH1 NP genes reveals a total of 21 nucleotide differences (2.7%), 
1 0 eight Of which encode amino acid replacements (3'.i%). Since this PCR- 
engineered NP gene Was obtained using Taq polymerase, which is known 
to incorporate mutations, It is possible that some of these differences 
were introduced during PCR amplification. However, 15 of these 
nucleotide differences are located within the overlapping region 

1 5 between the TSWV-BL cDNA and PCR clones, and all but one of these 

differences (position 1702) are present in both TSWV-BL S RNA derived 
clones. This comparison clearly shows that the PCR amplification did 
not contribute greatly, if at all, to the difference between the 
nucleotide sequences of these two NP genes. The nucleotide difference 

2 0 at position 1702 results in the amino acid replacement of He with Ser, 

and even this difference could be due to the lack of homogeneity within 
the tSWV-BL Isolate. 

The possibility that the nucleotide differences can be attributed 

to divergence among the TSWV isolates is also supported by 

2 5 comparisons with other sequenced regions among TSWV-CPNH1, TSWV- 

L3, and TSWV-BI S RNAs. These comparisons are tabulated below: 

Percent nucleotide and amino acid sequence differences for the comparison of TSWV S 
RNA component from Isolates CPNH1 , L3 and BL a 
52 K Protein Gene Interqgnic NP Gene 

3 0 Comparison Nucleotide Amino Acid Nucleotide Nucleotide Amino Acid 

CPNH1/L3 68/1396 b (4.9) c 49/464(10.6) 46/511(9.0) 24/777(3.1) 4/258(1.6) 

CPNH1/BL 21/758(4.1) 23/251(9.2) 26/496(5.2) 19/765(2.5) 8/255(3.1) 

L3/BL 38/765(5.0) 20/254(7.9) 38/498(7.6) 19/767(2.5) 4/255(1.6) 
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a Comparisons are made using the sequence Information available from the 
particular component region of TSWV-BL The comparison for the TSWV-BL NP 
gene Includes the combined sequehce information from the cDNA clone, pTSWVS-23 
and PCR-engineered Insert. 
5 b Comparison numbers are total differences (nucleotides or amino acids) 
divided by total number Of positions (nucleotides or amino acids) compared. For 
both nucleotide and amlnd ftcld calculation gaps, regardless of length, were counted as 
one mismatch. 

c Numbers in parentheses are percentages. 
1 0 The nucleotide Sequence of the NIP genes from the CPNHi and L3 

Isolates differ from each Other by 3.1% and from the BL isolate by 
hearly a similar degree (2.5%). However, the NP amino acid sequences 
between CPNH1 and BL isolates differ by a considerably larger amount 
than they differ between the L3 and BL or CPNH1 and L3 isolates. The 
1 5 results tabulated above also reveal that the NP gene region of these 
tSWV isolates Is subject to a higher degree of selective pressure than 
the 52 K protein as the differences among the amino acid sequences of 
the 52 K protein range between 7.9 to 10.6%, more than twice that 
found for the amino acid Sequence of the NPs. Nucleotide sequence 
20 divergence is highest among the intergenic regions, indicating that this 
region Is subject to less Selective pressure than either genetic region. 

The presence of NP gene sequences in transgenic plants was first 
confirmed by PCR analysis. A NP DNA fragment of about 800 bp was 
specifically amplified from the total DNAs of transgenic NP(+) plants 
25 using the primers homologous to sequences flanking the NP gene, 
whereas no corresponding fragment was detected in control NP(-) 
plants. Expression of the NP gene was assayed in each Rn transgenic 
plant by DAS-ELISA* and the results are presented in the following 
table: 
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Reactions of RO transgenic plants expressing the nucleocapsid protein (NP) gene of 
tomato spotted wilt virus (TSWV) to inoculation with TSWV-BL isolate 





plant age 


Ro clone 


EUSA a 


Lesions/leaf D 


NP(+):NP(-)C 


5 


7-8 leaves: 


NP( + )1 


0.374 


7 (199) 


1:28 






NP( + )2 


0.015 


0 (199) 


0:199 






NP( + )3 


0.407 


23 (102) 


1:4 






NP(+)4 


0.386 


2 (102) 


1:51 






NP( + )5 


0.023 


0 (124) 


0:124 


10 




NP{+)6 


0.197 


35 (325) , 


1:9 






NP( + )7 


0.124 


1 (325) 


1:325 




9-10 leaves: 


NP( + )8 


0.344 


36 (36) 


1:1 






NP( + )9 


0.327 


2 (20) 


1:10 


15 




NP( + )10 


0.406 


34 (33) 


1:1 






NP( + )11 


0.156 


5 (20) 


1:4 






NP(+)12 


0.133 


9 (57) 


1:6 






NP(+)13 


0.144 


2 (7) 


1:4 






NP(+)14 


0.040 


0 (19) 


0:19 
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NP( + )16 


0.053 


0 (10) 


0:10 




5-6 leaves: 


NP( + )20 


0.487 


203 (117) 


2:1 






NP(+)21 


0.042 


0 (117) 


0:117 






NP(+)22 


0.142 


0 (208) 


0:208 


25 




NP(+)23 


0.317 


223 (208) 


1:1 






NP( + )24 


0.051 


0 (35) 


0:35 






NP(+)25 


0.286 


13 (35) 


1:3 






NP( + )26 


0.037 


0 (22) 


0:22 






NP( + )27 


0.425 


305 (22) 


14:1 



3 0 a production of the NP in transgenic plants was assayed by double antibody 

sandwich enzyme-linked immunosorbent assay (DAS-ELISA); concentration of 
antibodies against virori tor coating: 1 ujg/ml; dilution of conjugate to the NP of 
TSWV-BL: 1:250; results taken 150 min. after adding substrate; readings at 405 
nm. 

3 5 bi oca i lesions that developed oh Inoculated leaves were counted 7 days after 
Inoculation. Data represent the average of three Inoculated leaves. Data In 
parentheses are the number of lesions produced from control NP(-) plants 
inoculated with the same Inoculum. 

c the ratio of local lesions that developed on NP(+) plants transformed with pBIN19- 
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NP + versus local lesions that developed on the control NP(-) plant when inoculated 
with the same Inoculum. 

Of the 23 NP(+) clones, 10 produced high levels of NP, 5 
accumulated intermediate levels of NP, and the remaining 8 produced 
5 low levels of NP. The si*e of the NP expressed In transgenic plants was 
analyzed using Western blot. Many polypeptides from tobacco extracts 
were reactive to the antibodies against the whole viron even though the 
antibodies Were pre-absorbed with extracts from healthy tobacco 
plants. Of those, only one band was unique to the pattern of 
1 0 polypeptides from tobacco plants transformed with' the NP gene. This 
polypeptide was estimated to be around 29 kDa, which is hear the 
expected size of the native NP. No antibody reactive-protein band of 
similar size was found in extracts from transgenic plants containing 
the Vector pBIN19. 

1 5 Inoculation of tobacco leaves with TSWV-BL isolate could result 

in either systemic infection or necrotic local lesions, depending upon 
weather conditions and physiological stages of plants. When Rq plants 
were tested with TSWV-BL tor viral resistance, TSWV-BL induced 
typical necrotic lesidfts OH the inoculated leaves of control NP(-) plants 

2 0 6-8 days after inoculation. However, transgenic NP(+) plants showed a 

Spectrum of resistance to the virus when compared to control NP(-) 
plants. Eleven of the 23 NP(+) plants did not develop any local lesion or 
the number of lesions that developed was at least 20-fold less than 
that on the corresponding inoculated NP(-) plants. Three NP(+) plants 

2 5 had intermediate reactions (5- to 19-fold less lesions than controls) 

while the remaining 9 plants had low or no resistance. None of the 
inoculated NP(+) or NP(-) plants showed systemic Infection, 
symptomless Rq plants were monitored until the end of their life cycle, 
and ho symptom Was observed throughout their life cycles. The 

3 0 inoculated leaves of the symptomless NP(+) plants were checked for the 

presence of the virus oh the leaves of C. quinoa plants. No virus was 
recovered from TSWV-BL-challenged leaves of highly resistant NP(+) 
plants, suggesting that the virus cold not replicate or spread in these 
NP(+) plants. 
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Leaf discs from selected Rrj plants were subcloned, and the 
regenerated plantlets were challenged by the virus. All subcloned Ro 
plants displayed levels of resistance similar to their corresponding 
original Ro plants. 

5 Since TSWV is widespread and many biologically distant strains 

exist* the effectiveness of the transgenic plants to resist infections by 
different TSWV isolates were also tested. Five TSWV isolates Were 
chosen in this study to challenge Ri plants germinated on kanamycin- 
cOntainihg medium: TSWV-BL, Arkansas, 10W pakchoy, Begonia and 
10 Brazil. The first three isolates were reactive to the antibodies against 
the Whole virion and the NP of TSWV-BL (the common TSWV "L" 
serogroup) (see figure 5). Begonia isolate reacted strongly to the 
antibodies against the NP of TSWV-I (the "I" serogroup) but not to those 
raised against the TSWV-BL NP, and therefore belonged to the T 

1 5 serogroup. No detectable reaction of Brazil isolate Was found to the 

antibodies against either the NP of the TSWV-BL or the TSWV-I 
serogroup, and it Was weakly reactive to the antibodies against the 
Whole viron of TSWV-BL. Moreover, this isolate caused systemic 
mottle and crinkle on the leaves of Infected tobacco and N. benthamiana, 

2 0 but did not Infect squash or cucumbers indicating that it is a distinct 

isolate from the cucurbit isolate. These results indicate that this 
isolate may be considered to be a third serogroup. 

Seedlings derived from seven Ro lines were germinated on 
kanamycin medium and inoculated with the above TSWV isolates. 

2 5 Inactivity data were recorded daily starting seven days after 

inoculation. Plants Inoculated with TSWV-BL, Arkansas, 10W pakchoy 
or Bra2il isolates were rated susceptible if virus symptoms were 
observed on uninocUlated leaves. Plants inoculated with the Begonia 
isolate were rated susceptible if local lesions were observed on 

3 0 inoculated leaves, since this isolate does not cause systemic infection 

in tobacco. All inoculated control NP(-) Ri plants Were susceptible to 
infection by these five isolates. They were systemically infected 12 
days after inoculation except that transgenic Ri plants inoculated with 
Begonia produced only local lesions on the inoculated leaves. However, 
3 5 almost all NP(+) Ri plants were highly resistant to the homologous 



isolate TSWV-BL, while much lower percentages of NP(+) Ri plants 
were resistant to heterologous isolates Arkansas, 10W pakchoy and 
Begonia. On the other hand, all NP(+) Ri plants from the seven 
transgenic lines Were susceptible to the Brazil isolate, even though a 
5 slight delay (1 to 2 days) In symptom expression Was observed in some 
of the high NP-expressing NP(+) Ri plants from line NP(4)4. 

Resistant R1 plants remaJhed symptomless throughout their life 
cycles. The inoculated leaves of seventeen symptom less NP(-j-) plants 
were checked for the presence of the virus by back inoculation on 
1 0 leaves of Chenopodiuni quinoa plants. No virus was recovered from the 
inoculated leaves of symptomless NP(+) plants, suggesting that the 
virus could not replicate or spread in these NP(+) plants. 

The relationship between the level of NP accumulation in 
transgenic plants and the degree of resistance to heterologous TSWV 

1 5 isolates was also studied. Analysis of the data described above 

suggested that Ri plants derived from Rrj lines with low levels of NP 
offered the best resistance to the heterologous isolates of the "L" 
serogroup (Arkansas and iOW pakchoy) while Ri from a Rrj line with 
high level of NP were resistant to the Begonia isolate, which belongs to 

2 0 the T serogroup. For example, an average 76% of inoculated Ri plants 

from low NP expressing lines NP(+) 2, 14, and 21 were resistant to 
infections by the Arkansas and 10W pakchoy isolates, While resistance 
to these isolates was observed in only 11% of similarly inoculated 
plants from high NP expressing lines NP(+)4, 9, and 23. On the other 

2 5 hand, the Begonia isolate infected 79% of Ri plants from the low NP 

expressing line NP(+)2, 14, and 21 but only 19% from high NP expressing 
line NP(+)4. 

Therefore, it was concluded that the transgenic Ri plants 
expressing low levels of the NP gene were highly resistant to infection 

3 0 with the isolate 10W pakchoy (the I" serogroup), but not to Begonia 

isolate (the "I" serogroup). In contrast, the highly NP-expressing Ri 
plants Were very resistant to infection by Begonia Isolate but not to 
infection by the isolate from 10W pakchoy. 

Thus, it was of interest to accurately quantitate the relation of 
3 5 NP expression in individual plants with resistance to the heterologous 



isolates. In a number of inoculation experiments reported herein, leaf 
samples of transgenic plants Were taken before inoculating with the 
Arkansas and 16W pakchoy Isolates. Samples were also taken from 
non-inoculated leaves of plants inoculated with the Begonia isolate 
5 after observations bf the apparent relation between NP expression 
levels arid resistance were made. The latter method of sampling could 
be done Without interference from infection by the Begonia isolate 
because this isolate does hot cause systemic infection in tobacco nor 
reacts With antibodies td the TSWV-BL NP. All samples Were assayed 
1 0 for relative NP levels by DAS-ELISA using antibodies raised to isolated 
NP of TSWV-BL. Figures 5 and 6 show the relation between NP levels in 
transgenic Ri plants (irrespective of the Rrj lines they came from) and 
their resistance to the Arkansas and 10W pakchoy isolates or to the 
Begoriia isolate. Nearly all transgenic Ri plants with very low or 

1 5 Undetectable ELISA reactions (0-0.05 OD405nm) were resistant to 

infections by the Arkansas and 10W pakchoy isolates (the "L" serogroup) 
but susceptible to the Begonia isolate (the "i" serogroup). In contrast, 
almost all R1 plants that gave high ELISA reactions (0.4-1.0 OD405nm) 
were resistant to the Begonia isolate but susceptible to the Arkansas 

2 0 and 10VV pakchoy isolates. 

The double-stranded (ds) RNA was isolated from the N. 
benthamiana plants infected with TSWV-B using a combination of 
methods [See Acta Horticulturae 186:51 (1986), and Can. Plant Dis Surv 
68:93(1988)] Which have been successfully used for Isolation of dsRNA 

2 5 from tissue infected With grapevine leafroll virus. The dsRNA was 

chosen for the cDNA synthesis since isolation of the virus particle from 
this isolate has not been possible [see Plant Disease 74:154 (1990)]. In 
order to make a cDNA library specific to the S RNA of TSWV-B, the 
double stranded S RNA Was gel-purified, denatured by methyl-mercury 

3 0 treatment, and subjected to cDNA synthesis procedure provided by 

Promega using random primers. The synthesized cDNA fragments were 
cloned via an EcoRI adaptor into the EcoRI digested X ZAPII 
(Strategene), and positive clones were identified by colony 
hybridization using the cDNA probes prepared by reverse transcription 
3 5 of gel-purified S RNA. Dozens of positive clones were analyzed oh 
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agarose gels and only three overlapping clones containing the largest 
inserts (L1, L22 and L30) were selected (see figure 3), covering nearly 
entire TSWV-B S RNA. 

The nucleotide sequences of the inserts in clones L1, L22 and L30 
5 were determined from both DNA strands, first by the universal and 
reverse primers and then by the internal primers designed for 
sequencing the S RNA of f SWV-B. Sequencing was done using the 
Sanger dideoxyribonucleotide method, T7 polymerase (U.S. 
Biochemicals, SeqUehase T ^), and the double-stranded sequencing 

10 procedure described by Siemienlak [see Analyt.' Bjochem. 192:441 
(1991)]. The sequence analyses of these clones revealed inserts of 
1.994 kb, 2.368 kb and 1.576 kb, respectively, and these sequences 
represented 93% of the S RNA genome (see figure 3). The assembled 
sequence was analyzed by comparisons with sequences of TSWV 

15 isolates CONH1, L3, I, and BL using computer programs available from 
the Genetics Computer Group (GCG, Madison, Wl). 

Computer analysis showed that the assembled sequence of 2.842 
kb covered the complete 52 K nonstructural protein gene, the complete 
intergenic region (629 bp), and 737 bp of the NP gene (only 39 N- 

20 terminal nucleotides of the N gene were not represented). In order to 
obtain this missing region of the N gene, a primer 
TTCTGGTCTTCTTCAAACTCA, Identical to a sequence 62 nucleotides 
from the initiation codon of the N gene, was end-labeled with 
polynucleotide kinase to screen the cDNA library described above. Five 

2 5 putative clones were obtained. Sequence analysis of the five clones 

showed that only clones S6 and S7 contain these 39 missing nucleotides 
of the N gene. The latter clone also included the extreme 3' end of the S 
RNA 

The 5' extreme end of the S RNA was obtained using the 5' RACE 

3 0 System (GIBCO). Both ssRNA of TSWV-B and total RNAs isolated from 

tobacco plants infected with TSWV-B were used to synthesize first 
strand cDNA with an oligonucleotide (5'-CTGTAGCCATGAGCAAAG) 
complementary to the nucleotide positons 746-763 of te TSWV-B S 
RNA. The 3'-end of the first strand cDNA was tailed With dCTP using 
3 5 terminal deoxynucleotidyl transferase. Tailed cDNA was then amplified 
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by PCR using an anshor primer that anneals to the homopolymeric tail, 
and an oligonucleotide (5'-TT ATATCTTCTTCTTGGA) that anneals to the 
nucleotide positions 512-529 of the TSWV-B S RNA. The PCR- 
ampllified fragertient Was gel-purified and directly cloned into the T- 
5 vector pT7Blue (Novagen) for sequence analysis. Eight independent 
clones were sequenced With an oligomer primer (5'- 
GTTCTGAGAtTtGCTAGT) close to the 5' region of the S RNA (nucleotide 
positions 40-57 of the TSWV-B S RNA). Six of the resulting clones 
contained the 5' extreme end of the S RNA and the 5'-terminal 
1 0 nucloetide sequence from these clones was identical.. Thus, the 

complete nucleotide sequence of the TSWB-B S RNA is 3049 nucleotides 
in length. 

Thus these two clones together with the three clones previously 
sequenced (L1 , L22, L30, S6 and S7) covered a total of 3032 nucleotides 

1 5 depicted above. Comparisons with the terminal sequences of TSWV- 

CPNH1 and TSWV-I revealed that although the extreme 5' end of 18 
nucleotides was hot represented in the assembled sequence, the 
extreme 3'-terminus of the TSWV-B S RNA is identical to the extreme 
3' end of the TSWV-I S RNA and is only one out of fifteen nucleotides 

2 0 different from the extreme 3' end of TSWV-CPNH1. The conservation of 

the terminal sequence among TSWV isolates is consistent with 
observations of the other members of Bunyaviridae genera, and supports 
the hypothesis that the terminal sequences might form stable base- 
paired structure, which could be involved in its replication and 

2 5 encapsulation. 

The complete nucleotide sequence of the S RNA genome of TSWV- 
B (the Brazilian isolate discussed above) according to the present 
invention Is: 

AGAGCAATTG GGTCAX1TJ.T TATTCTAAAT CGAACCTCAA CTAGCAAATC 50 

3 0 TCA.GAA.CTGT AATAAGCACA AGA.GCACAAG AGOCAC AATG TCATCAGGTG 100 

TTI^TGAATC GftTCATTC&G ACAAAGGCTT CAGTITGGGG ATCGAGAGCA 150 
TCTGGIAAGT CCAICGTGGft TTCTmCTCG ATTTATGAGT TTCCAACTGG 200 
rrCTQCACTG GTTCAAACTC AGTTGTACTC TSftTTOGAGG AG CA AAAGI A 250 
GCITGGGCTA CACTTCAAAA. ATTGCTGATA TTOCTGCTCT AGAGGAGGAA 300 
3 5 ATTTTATCTC ACtAACGTTCA TATCCCACTTG TTTGATGATA TTGATTTCAG 350 
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CaiCaaMC AATGATTCTT MM TTr T grrrcr ir.CAAr.ArAn 400 

TIMCACCAA, TGGAGTGAAG CATCAGGCTr attttaaact TT-mrerTT 450 

GgXAAlTCC ATQCtTITGA AC *7rGTGATG AGCAGGTC AG arattcttah 500 

CAgflTOCGG craaAGA AG AAGAT& t aat tcctgatcac. aaatatata T 550 

CTCC I ^PA CflAJjGQAT CT (TICTGCTOTG TCAAA GAACA TAfTTTAPAAA 600 

GJQGAAAlfiA GOACMTQA ryyrrrmrrr. aaactqaatc TTY-rrrrrrr 650 

TftACftGAftAT GTICAJGAGT GGCTCTATAft mrAAACCA AAmr^Arr 7 00 

AGAKXtAAAG iMTAAGAGA ACTTTTAAATT rTrrr n rAGT nwm-rrrp: 750 

CICATOGCTA CAGftAAAGAA CAlTATCTrT AArrr rcAAG (TrmYTTTAA 800 

AGCm^M' ffltlUKATT TOAGitgag rrrr rnGcrc AGAArrrraA 850 

AAtoll'l'lViftA GCAftATAGQC ATACAGAAfy TTrTCAACTT TCTAf^AnAP 900 

GMCCGGEA AAAGTOCm TTTGTCTATT GCAT GeATGC. rAAATTAPAA 9 50 

CAGraflGGM ACAGCTT lAA ATTSTCACTCT tatatctaga CATranrrTr 1000 

C^TCCCTO GTCCAAAGCT GCTTITCAAT TATTAATCAT TOCTCCGAT 1050 

CTGAAAGAGC CmCAAGAC TgrBTATCAT rmTATATC CTrAAAGGAT H00 

ImtA-rgCT tTGCri fflGA (TTrAC A CTTC CTTTGr ACAA GTTYTTrrcrA 1150 

AeAASCTSCA AGAAGAiGTC Atcawata rTA TAAAPAQ crrrGAArTA 1200 

ACO CCAGCTA AGCTGGATCT AQGTGAAAGA ACCITCAACT ACAGTGAAGA 1250 

TOTICGAag AAGAAGTATT ttctttpaaa aacactcgaa TnrrrrcyAG 1300 

XAAAIGTGGA GACEATGTCT TATITCGATA nrATCCAGAT TTmrATO 1350 

AflGAlAGACT TTGCCAGAGG AGAGATTAGA ATCTCCCCTT. AATTTArTTr 1400 

TfiJTQCAAGA TCT I TGCTGA AGCTGGATIT GAGT AAGATT. aaggaaaaga 1450 

AGTCCttGAC TrGGGAAACA TCCAGCTATG ATr-TAG^TA aaa^^st^ i 50 o 

AmcmcrcT aagtagtAtt tgtcaacttg CTrATCcnr atgttgttea 1550 

TTTCTITEAA AlCTAAAGTA AGTTAGAITC AAGTAGTITA GTATGCTATA 1600 

GCAnATTAC AAAAAAIACA AAAAAAIACA AAAAAAIACA AAAAAIAIAA 1650 

AAAACCCAAA AAGATCCCAA AAGGGACGAT TTGGITGATT TACTCTGTTT 1700 

TAGGCTTAIC TAAGCTGCTT TTGTTTGAGC AAAATAACAT TGTAACATGC 1750 

AAIAACTGGA AITIAAAGTC CmAAAGAAG TTTCAAAGGA CAGCTEAGCC 1800 

AAAATTGGTT TITGTITITG TTTTTTTGTT TTITGTTnT TIGTTTTAIT 1850 

TrrATiTTTA GTTEATriTT TGTTTITGTT ATnTEATIT ITAITITATT 1900 

niTiTiftrr ttattiaiat aiatatcam cacaatccac acaaataait 1950 

TEAAJTICAA ACATTCTACT GATTTAACAC ACTIAGCCTG ACTTEATCAC 2000 

ACTTAACACG CTTAGTIAGG CTTTAACACA CTGAACTGAA TTAAAACACA 2050 

CTEAGTAITA TGCATCTCTT AATTAACACA CTTEAAIAAT ATGCATCTCT 2100 
28 
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GAATCAGCCT TAAAGAAGCT TTTATGCAAC ACCAGCAATC TTGGCCTCTT 2150 

TCTTAACTCC AAACATTTCA TAGAATTTGT CAAGAITATC ACTGTAATAG 2200 

TOCATAGCAA TQCTTCOCTT AGCAITGQGA. TTGCAAGAAC TAAGTATCTT 2250 

GGCATATTCT TTCCCTTTGT TTATCTGTGC ATCATCCATT GTAAATCCTT 2300 

5 TGcrrrmAG cactgtqcaa accttccoca gagcttcctt agtgttgtac 2350 

TmGTTGGTT CAATCCCTAA CDCCTTGTAC TTTGCATCTT GATATATGGC 2400 

AAGAACAACA CTGATCATCT CGAAGCTGTC AACAGAAGCA ATGAGAGGGA 2450 

TACTACCTCC AAGCATTATA GGAAGTCTCA CAGATTTTGC ATCTGCCAGA 2500 

GQGAGCOOGT AAGCTTGGAC CAAAGGGTGG GAGGCAATTT TIGCTTTGAT 2550 

1 0 AATAGCAAGA TTCTCATTGT TTGCAGTCTC TTCTATGAGC T1CACTCTEA 2600 

TCATGCTATC AAGCCTCCTG AAAGTCATAT CCTTAGCTCC AACTCTTTCA 2650 

GAATTTfrCT TTATCGTGAC CTTACCAAAA GTAAAATCAC TTTGGTTCAC 2700 

AAGTTTGATA ATGCCTTGGC GATTCTTCAA. GAAAGTCAAA CATGAAGTGA 2750 

TACTCAmT CTTAATCAGG TCAAGATTTT CCTGACAGAA AGTCTTAAAG 2800 

1 5 TTGAATQCGA CCTGGTTCTG GTCTTCTTCA AACTCAACAT CTGCAGATTG 2850 

AGTTAAAAGA GAGACAATGT TTTCTTTTGT GAGCTTGACC TIAGACATGG 2900 

TGGCAGTTTA GATCIAGACC TTTCTCGAGA GATAAGATTC AAGGTGAGAA 2950 

AGTGCAACAC TGTAGACCGC GGTCGTTACT TATCCTGTTA ATGTGATGAT 3000 

TTGTAITGCT GAGTATTAGG TTTTTGAATA AAATTGACAC AATTGCTCT 3049 

2 0 The deduced amlho acid sequences of the nonstructural (single 

underlined above) and nucleocapsid proteins according to the present 
invention are: 
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Trp 


He Tyr Glu Phe Pro 
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Gly 


Ser 


Pro Leu Val 
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Thr Gin 
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45 


Leu 


Tyr Ser Asp Ser Arg 


Ser 


Lys 


Ser 


Ser Phe Gly 


Tyr 


Thr Ser 
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Lys 


He Gly Asp He Pro 


Ala 


Val 


Glu 


Glu Glu He 
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Ser Gin 




65 








70 
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Asn 


Val His He Pro Val 


Phe 


Asp 


Asp 


He Asp Phe 


Ser 


He Asn 
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He 


Asn Asp Ser Ph^ l&u 


Ala 


He 


Ser 


Val Cys Ser 


Asn 


Thr Val 
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100 




105 


Asn 


Thr Asn Gly Val Lys 


His 


Gin 


Gly 


His Leu Lys 


Val 


Leu Ser 
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Leu Ala Gin Leu His Pro Phe Glu Pro Val Met Ser Arg Ser Glu 

125 130 135 

He Ala Ser Arg Phe Arg Leu Gin Glu Glu Asp He He Pro Asp 

140 145 150 

5 Asp Lys Tyr He Ser Ala Ala Asn Lys Gly Ser Leu Ser Cys Val 

155 160 165 

Lys Glu His Thr Tyr Lys Val Glu Met Ser His Asn Gin Ala Leu 

170 175 180 

Gly Lys Val Asn Val Leu Ser Pro Asn Arg Asn Val His Glu Trp 
1 0 185 190 195 

Leu Tyr Ser Phe Lys Pro Asn Glu Asn Gin He Glu Ser Asn Asa 

200 205 ■ 210 

Am Thr Val Asn Ser Leu Ala Val Lys Ser Leu Leu Met Ala Thr 

215 220 225 

1 5 Glu Asn Asn He Met Pro Asn Ser Gin Ala Phe Val Lys Ala Ser 

230 235 240 

Thr Asp Ser His Phe Lys Leu Ser Leu Gin Leu Arg He Pro Lys 

245 250 255 

Val Leu Lys Gin He Ala He Gin Lys Leu Phe Lys Phe Ala Gly 

2 0 260 265 270 

Asp Glu Thr Gly Lys Ser Phe Tyr Leu Ser He Ala Cys He Pro 
275 280 285 

Asn His Asn Ser Val Glu Thr Ala Leu Asn Val Thr Val He Cys 
290 295 300 

2 5 Arg His Gin Leu Pro He Pro Lys Ser Lys Ala Pro Phe Glu Leu 

305 310 315 

Ser Mat He Phe Ser Asp Leu Lys Glu Pro Tyr Asn Thr Val His 

320 325 330 

Asp Pro Ser Tyr Pro Gin Arg He Val His Ala Leu Leu Glu Thr 

3 0 335 340 345 

His Thr Ser Phe Ala Gin Val Leu Cys Asn Lys Leu Gin Glu Asp 
350 355 360 

Val He He Tyr Thr He Asn Ser Pro Glu Leu Thr Pro Ala Lys 
365 370 375 

3 5 Leu Asp Leu Gly Glu Arg Thr Leu Asn Tyr Ser Glu Asp Ala Ser 

380 385 390 

Lys Lys Lys Tyr Phe Leu Ser Lys Thr Leu Glu Cys Leu Pro Val 

395 400 405 

Asn Val Gin Thr Mat Ser Tyr Leu Asp Ser He Gin He Pro Ser 

4 0 410 415 420 

Trp Lys He Asp Phe Ala Arg Gly Glu He Arg He Ser Pro -Gin 
425 430 435 

Ser Thr Pro He Ala Arg Ser Leu Leu Lys Leu Asp Leu Ser Lys 
440 445 450 
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He Lys Glu Lys Lys Ser Leu Thr Trp Glu Thr Ser Ser Tyr Asp 
455 460 465 

Leu Glu; 
and 

5 Met Ser Lys Val Lys Leu Thr Lys Glu Asn He Val Ser Leu Leu 
5 10 15 

Thr Gin Ser Ala Asp Val Glu Phe Glu Glu Asp Gin Asn Gin Val 
20 25 30 

Ala Phe Asn Phe Lys Thr Phe Cys Gin Glu Asn Leu Asp Leu He 
1 0 35 40 45 

Lys Lys Met Ser He Thr Ser Cys Leu Thr Phe Ley Lys Asn Arg 
50 55 60 

Gin Gly He Met Lys Val Val Asn Gin Ser Asp Phe Thr Phe Gly 
65 70 75 

1 5 Lys Val Thr He Lys Lys Asn Ser Glu Arg Val Gly Ala Lys Asp 

80 85 " 90 

Met Thr Phe Arg Arg Leu Asp Ser Met He Arg Val Lys Leu He 

95 100 105 

Glu Glu Thr Ala Asn Asn Glu Asn Leu Ala He He Lys Ala Lys 

2 0 110 115 120 

He Ala Ser His Pro Leu Val Gin Ala Tyr Gly Leu Pro Leu Ala 
125 130 135 

Asp Ala Lys Ser Val Arg Leu Ala He Met Leu Gly Gly Ser He 
140 145 150 

2 5 Pro Leu He Ala Ser Val Asp Ser Phe Glu Met He Ser Val Val 

155 160 165 

Leu Ala He Tyr Gin Asp Ala Lys Tyr Lys Glu Leu Gly He Glu 

170 ' 175 180 

Pro Thr Lys Tyr Asn Thr Lys Glu Ala Leu Gly Lys Val Cys Thr 

3 0 185 190 195 

Val Leu Lys Ser Lys Gly Phe Thr Met Asp Asp Ala Gin He Asn 
200 205 210 

Lys Gly Lys Glu Tyr Ala Lys He Leu Ser Ser Cys Asn Pro Asn 
215 220 225 

3 5 Ala Lys Gly Ser He Ala Met Asp Tyr Tyr Ser Asp Asn Leu Asp 

230 235 * 240 

Lys Phe Tyr Glu Met Phe Gly Val Lys Lys Glu Ala Lys He Ala 
245 250 255 

Gly Val Ala 

4 0 As the hucleocapsid protein gene depicted above is on the viral 

complementary strand, the nucleocapsid protein gene of TSWV-B is: 

ATG TCT AAG GTC AAG CTC ACA AAA GAA AAC ATT GTC TCT CTT TTA 45 
ACT CAA TCT GCA GAT GTT GAG TIT GAA GAA GAC CAG AAC CAG GTC 90 
31 
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GCA TTC AAC TTT AAG ACT TTC TGT CAG GAA AAT CTT GAC CTG ATT 135 
AAG AAA ATG AGT ATC ACT TCA TGT TTG ACT TTC TTG AAG AAT CGC 180 
CM GGC ATT ATG AAA GTT GTG AAC CAA AGT GAT TTT ACT TTT GOT 225 
AAG GTC ACG ATA AAG AAA AAT TCT GAA AGA GTT GGA GCT AAG GAT 270 
ATG ACT TTC AGG AGG CTT GAT AGC ATG ATA AGA GTG AAG CTC ATA 315 
GAA GAG ACT GCA AAC AAT GAG AAT CTT GCT ATT ATC AAA GCA AAA 360 
ATT GCC TCC CAC OCT TTG GTC CAA GCT TAC GGG CTG CCT CTG GCA 405 
GAT GCA AAA TCT GTG AGA CTT GCT ATA ATG CTT GGA GGT AGT ATC 450 
CCT CTC ATT GCT TCT GTT GAC AGC TTC GAG ATG ATC AGT GTT GTT 495 
CTT GCC ATA TAT CAA GAT GCA AAG TAC AAG GAG TTA GGG ATT GAA 540 
CCA ACT AAG TAC AAC ACT AAG GAA GCT CTG GGG AAG GTT TGC ACA 585 
GTG CTT AAA AGC AAA GGA TTT ACA ATG GAT GAT GCA CAG ATA AAC 630 
AAA" GGG AAA GAA TAT GCC AAG ATA CTT AGT TCT TGC AAT CCC AAT 675 
GCT AAG GGA AGC ATT GCT ATG GAC TAT TAC AGT GAT AAT CTT GAC 720 
AAA TTC TAT GAA ATG TTT GGA GTT AAG AAA GAG GCC AAG ATT GCT 765 
GGT GTT GCA TAA 777 

The compete S RNA of TSWV-B should be 3049 nucleotides In 
length, 134 nucleotides longer than S RNA of TSWV-CPNH1. This 
difference was mainly attributed to the elongated intergenic region of 
the TSWV-B S RNA. Analysis of the sequenced region of TSWV-B S RNA 
revealed two open reading frames as depicted above, which is similar 
to other TSWV Isolates. The larger one was localized on the viral RNA 
strand originating at nucleotide 88 and terminating at nucleotide 1491. 
The smaller one on the vial complementary strand was defined by an 
initiation codon at nucleotide 2898 and a termination codon at 
nucleotide 2122. The two open reading frames were separated by an 
intergenic region of 629 nucleotides. Comparisons of the entire 
sequenced TSWV-B S RNA With S RNA regions of other isolates in the 
following table which depicts the percent homology comparison of 
aligned nucleotide and amino acid sequences of the TSWV-B S RNA with 
those of the other isolates: 
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n t 


ft t 


a a 


n t 


n t 


a a 


B/CPNH1 


76.4*> 


80.0 


86.1(78.3) c 


72.4 


77.5 


91.5(79.1) 


B/L3 


75.8 


79.0 


89.0(82.0) 


76.4 


78.0 


91.1(79.9) 


B/BL 


76;3 


- 


- 


72.8 


77.6 


90.3(79.5) 


B/I 


63.0 


- 


- 


* 


63.1 


69.7(55.3) 


CPNH1/L3 


94.8 


95.6 


92.0(89.4) 


89.2 


96.8 


99.6(98.5) 


CPNH1/BL 


96.4 






95.9 


97.2 


98.8(96.9) 


CPNtil/I 


62.7 








60.8 


69.5(55.1) 


L3/BL 


95.1 






92.6 


97.3 


99.2(98.5) 


L3/I 


60.9 








60.9 


69.5(55.1) 


I/BL 


61.7 








60.9 


68.8(53.9) 



a The partial or complete S RNA sequences of isolates TSWV-CPNH1 (2.916 

1 5 kb), TSWV-L3 (2.837 kb), TSWV-BL (2.037 kb) and TSWV-I (1.144 kb) were 

used for comparisons With the S RNA sequence of the TSWV-B (3.049 kb). 

D Percent similarities were calculated by Comparison of their nucleotide or 
predicted amino acid seqUehce using the program BESTFIT of the GCG Sequence 
analysis software package. 

2 0 c Percent identity Is in parenthesis. 

As depicted, the greatest nucleotide sequence similarity (75.8%- 
76.4%) was shown with the L-type isolates (CHNH1, L3 and BL). To the 
lesser extent, there was nucleotide sequence similarity (63%) between 
the TSWV-B S RNA and the S RNA of TSWV-I assigned to I serogroup. 

2 5 For comparison, the sequenced S RNA regions of the L-type isolates 

(CHPN1, L3 and BL) shared 94.8%-96.4% nucleotide sequence 
similarities. 

The open reading frame of 777 nucleotides encodes the N protein 
of 258 amino acids with a predicted molecular weight of 28700 Da. the 

3 0 sequence comparisons of the N open reading frame from TSWV isolates 

revealed that nucleotide sequences of the N genes from the isolates 
CPNH1, L3 and BL differs from TSWV-B by a considerably larger amount 
(22%-22.5%) than they differ from each other (2.7%-3.2%). Consistent 
to the results of the Immunological analysis, the N amino acid 
3 5 sequences among CPNH1, L3 and BL isolates are more closely related to 
each other (98.8%-99.6% similarities or 96.9%-98.5% identities) than to 
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the TSWV-B (90.3%-91.5% similarities or 79.1%-79.9% identities). 
Much lower homology was observed to TSWV-I at both nucleotide 
(63.1%) and amino acid (69.7% similarity or 55.3% identity) levels. 
Except for the N open reading frame of TSWV-I that encodes 262 amino 
acids, the N open reading frames of the other isolates code for the 258 
amino acids. Computer analysis suggested that the extra residues of 
TSWV-I N open reading frame resulted from the amino acid sequence 
insertions (residues 82 through 84 and residue 116). One potential N- 
glycosylation site is found at residue 68. 

The second open reading frame of 1404 nucleotides encodes the 
nonstructural protein of 467 amino acids with a predicted molecular 
weight of 52566 Da. Comparisons with homologous open reading frames 
of TSWV-CPNH1 and TSWV-L3 showed 80% and 79% similarities at the 
nucleotide level, and 86.1% (or 78.3% identity) and 89% (or 82.0% 
identity) similarities at the amino acid level. This open reading frame 
contains four potential glycosylation sites, which are located in the 
exactly same positions as those Of TSWV-CPNH1 and TSWV-L3. 

The intergenic region of the TSWV-B S RNA was, due to several 
insertions, 126 and 41 nucleotide longer than the counterparts of 
TSWV-CPNH1 and TSWV-L3, respectively. The sequence analysis by the 
program FOLD indicated the intergenic region can form very complex and 
stable hairpin structure by internally base-pairing U-rich stretches 
with A-rich stretches of the intergenic region, which had similar 
stability to those produced from TSWV-CPNH1 and TSWV-L3 as 
indicated by minimum tree energy values. This internal base-paired 
structure may act as a transcription termination signal. 

The results tabulated above also revealed that the N protein of 
TSWV-B is subject to a higher degree of selective pressure than the 52 
K protein; the similarities among the amino acid sequences of the 52 K 
protein are lower than that found for the amino acid sequence of the 
NPs. Nucleotide sequence divergence is highest among the intergenic 
regions, which Indicates that this region is subject to less selective 
pressure than either genetic region. 

The evolutionary relationships among the TSWV-B and other four 
TSWV isolates were analyzed and depicted in figure 4 in which the 
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evolutionary tree organization is consistent with the relatedness of 
serological data collected for these TSWV isolates. Thus, the TSWV-B, 
according to the present Invention, is more closely related to the L- 
type Isolates than to the l-type isolate TSWV-I, but Is much less 
similar to the L-type Isolates than the L-type isolates are to each 
other. - 

Despite a slight delay of symptom expression, transgenic plants 
did not show resistance to the Brazil isolate of TSWV; Serological 
results show that this isolate is distinct from the "L" and "I" type 
Isolates, and biologically different from the curc'urbit Isolate. The 
Brazii Isolate may thus belong to still another serogroup of TSWV. In 
any event, infectivity results show that It is unlikely that a single NP 
gene will provide resistance to all isolates in the Tospovirus genus. 
Transgenic plants according to the present invention that gave 
5 low or undetectable ELlSA reactions (0-0.05 OD405nm) were resistant 
to infection by the heterologous isolates (Arkansas and ioW pakchoy) of 
the 1" serogroup, whereas no protection against these Isolates Was 
found in plants accumulating high levels of the NP. Compared to the 
ELISA readings of control NP(-) plants (0.05 OD4o5nm). these 
0 transgenic plants may produce little, if any, TSWV-BL NP. Similar 
results have been observed in transgenic plants, in which the CP 
accumulation was not detected; these were highly resistant to virus 
infection. The mechanism underlying this phenomenon is presently 
unknown. It is likely that this type of resistance might be attributed to 
5 interference of CP RNA molecules produced in transgenic plants With 
viral replication, presumably by hybridizing to minus-sense replicating 
RNA of the attacking virus, binding to essential host factors (e.g., 
replicase) or interfering with virion assembly. 

It should be noted, however, that the resistance to the 
0 homologous TSWV-BL Isolate is apparently independent of the 

expression levels of the NP gene. Although the relative NP levels of the 
individual Ri plants Inoculated with TSWV-BL Were not measured, it is 
reasonable to assume that the NP produced in these inoculated P»1 
plants (a total of 145 plants tested) ranged from undetectable to high. 
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In contrast to the case for protection against the heterologous 
isolates of the "L" serogroup, protection against the Begonia isolate of 
the TSWV-I serogroup Was foUnd in the high NP-expressing Ri plants. 
Comparison of NP nucleotide sequence of the "L" serogroup with that of 
5 the "I" serogroup revealed 62% and 67% Identity at the nucleotide and 
amino acid levels, respectively, the difference of NP genes of the two 
serogroups might be so great that the NP (the V serogroup) produced in 
transgenic plants acted as a dysfunctional protein on the attacking 
Begonia isolate of the "I* serogroup. Incorporation of this "defective" 
1 0 coat protein into virions may generated defective' vjrus which inhibit 
virus movement or its further replication. This type of interaction is 
expected to require high levels of the NP for the protection. 
Alternatively, resistance to the Begonia isolate may also involve 
Interference of NP transcripts produced in Ri plants with viral 

1 5 replication. If this is true, more NP transcripts (due to the 

heterologous nature of two NP gene) may be required to inhibit 
replication of heterologous virus. 

Although there are ho obvious explanations for the results 
showing the relation of NP levels in individual Ri plants to resistance 

2 0 to the heterologous Isolates of the "L" and "I" serogroups, it is believed 

these are definite trends since the data were derived from a large 
number (190) of plants. Thus, it Is believed that a measurement of CP 
or NP levels in Individual plants may provide a more accurate way to 
relate NP or CP levels to resistance. By this form of data analysis, the 

2 5 results show that the resistance was more closely related to NP levels 

in each test plant than to the NP level of the Rrj line from which they 
were derived. For TSWV-BL Np gene in tobacco, at least, it appears that 
integration sites of the NP gerie in plant chromosome may not be 
important for viral resistance. 

3 0 Studies have also been conducted to determine the reaction of 

transgenic Ri and R2 tomatoes containing the nucleocapsid protein gene 
of TSWV-BL according to the present invention to the following 
isolates: Brazil (a distantly related virus), T91 (a closely related virus) 
and BL (a homologous isolate). In these studies, transgenic tomatoes (L. 
3 5 esculentum) were produced by A. tumefaciens-med\a\ed gene transfer 
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of the nucleocapsid protein (N) gene of the lettuce isolate of tomato 
spotted wilt virus BL into germinated cotyledons using modifications of 
published procedures [see Plant Cell Reports 5:81 (i986)]. The tomato 
line "Geneva 80" was selected for transformation because it contains 
5 the Trrt-22 gene which imparts resistance to TMV.'thus creating the 
possibility of producing a multiple virus-resistant line. 

Transformants were selected on kanamycin media and rooted 
transgenic tomatoes were potted and transferred into the greenhouse. 
Rl and R2 tomato seedlings expressed the NPT il gene, suggesting 
1 0 multiple insertions of this gene in the plant genome. In contrast, only 
18% of the seedlings produced detectable levels of the isl protein. 

Nine R1 and three R2 lines were tested for resistance to the 
following three Tospovitus described, specifically TSWV-BL, TSWV- 
T91, and TSWV-B. Infectivity was based upon visual inspection of test 

1 5 plants. In those cases where plants appeared healthy except for a few 

fust-colored rings or insect damage, extracts frorri these plants were 
inoculated to N. benthamiaha to test for the presence of the virus. As 
depicted in the following table, nearly all control tomato plants 
exhibited typical symptoms consisting of plant stunting, leaf yellow 

2 0 mosaic and rugosity 3 to 4 weeks after inoculations With TSWV-BL, 

TSWV-T91 or TSWV-B. However, only 4% of the R1 and R2 transgenic 
plants became infected with TSWV-BL, 7% with TSWV-T91, and 45% 
with TSWV-B. 
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Viral resistance In transgenic R1 and R2 tomatoes expressing the nucleoprotein gene 
of the lettuce strain of tomato spotted wilt virus 
Inoculating Isolates 3 
Plant Line TsWv-bl tsWV-T91 tswv-b 

5 R1 Plants: 



T13-1 


0/22 


1/26 


7/24 


T1.3-2 


6/20 


NTT** 


Krr 


T13-3 


2/42 


0/20 


12/18 


T13-4 


0/25 


MT 


NTT 


T13-9 


0/20 


NT 


Krr 


T13-10 


1/50 


2/26 


1,1/26 


T13-11 


0/22 


NT 


for 


T13-12 


1/29 


NT 


NTT 


T13-13 


0/22 


NTT 


Mr 


TOTAL 


10/252 


3/72 


30/68 


R2 Plants: 








T13-1-7 


0/8 


2/8 


5/8 


T13-1-9 


0/8 


1/8 


2/8 


T13-1-11 


0/8 


1/9 


5/9 


TOTAL 


0/24 


4/25 


12/25 


coNrmoLs 


92/95 


51/53 


52/53 



a plants were Inoculated at the one- to two-leaf stage with 5-, 10-, or 20- 
fold diluted leaf extract of N. benthamiana, H423 tobacco or tomato; the same plants 
were re-Inoculated 7 days later and symptoms were recorded after another 14 days; 

2 5 the reaction is expressed as number of plants with symptoms/number of plants 

tested 

b not tested 

Accordingly, the description above supports the finding that 
transgenic tomato plants that express the N gene of TSWV-BL show 

3 0 resistance to infection to TSWV-BL, to other TSWV isolates that are 

closely related to TSWV-BL, and to the more distantly related TSWV-B. 

In further limited studies with an additional isolate, all 
transgenic plants Were resistant to the 10W (pakchoy) isolate, whereas 
the controls were Infected. These results show that transgenic 
3 5 tomatoes are better protected against closely related isolates than 
distantly related isolates. Unlike In transgenic tobacco and N. 
benthamiana expressing the TSWV-BL N gene, the level of N protein 
expression did hot correlate with the observed protection in transgenic 
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tomatoes; 55% of th§ transgenic tomatoes were also resistant to a 
distantly related Isolate of TSWV-B, Which Was not observed in 
transgenic tobacco and N. behthdtnma plants. These discrepancies may 
reflect that tomato la Inherently less susceptible to Tospoviruses. 
5 In addition, studies were also conducted to determine virus 

distribution in a small number of plants at 5 and 1 weeks after 
inoculation, the distal halves from leaflets of all expanded leaves of 
eabh plant were ground and back-Inoculated onto N. benthamiana. The 
results taken seven days after Inoculation Showed that virus cannot be 
1 0 recovered from any leaf tissue of asymptomatic transgenic plants 
modulated With either TSWV-BL -T91, or -B, confirming the visual 
findings reported above. In transgenic plants showing symptoms, the 
Vims IS not distributed throughout the plant. For example, a transgenic 
plant which could not be Conclusively rated visually contained the virus 

1 5 In only two of the 8 leaves; the second leaves from the bottom and top 

bf the plant. Conversely, virus present in all leaves of the infected 
control plant, and Is absent In those of the healthy control plants. 

Graft inoculations were attempted to test Whether the resistant 
transgenic plants could become Infected if virus is Introduced into the 

2 0 vascular System. Ri and Rj> plants that had been inoculated at 1:5, 1:10 

or 1:20 dilutions of TSWV-BL, -T91, or -B were grafted onto control 
plants infected with the Same isolates and dilutions. The 34 transgenic 
plants Were asymptomatic after 31 days, although the hon-transgenic 
Controls were infected. After 23 days, the top 46 cm of transgenic 
25 plants had been trimmed away to Induce new growth and more plant 
Stress, Although the young, vigorously growing new shoots failed to 

show any symptoms on the 31st day post inoculation, 33%, 31% and 45% 
Of TSWV-BL, -T91 artd -B Were showing leaf or Stem symptoms, 
respectively at 45 days post Inoculation. These results Indicate that 

3 0 some transgenic plants are tolerant, and others are Immune to 

Infection. 

thus, according to one aspect of the present invention, transgenic 
plants expressing the MP gene of the TSWV-BL Isolate are highly 
resistant to Infections of both the homologous TSWV-BL isolate and 
3 5 heterologous isolates of the same serogroup (Arkahsas and 10W 
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pakchOy). More Significantly, the* resistance is effective to Begonia 
isolate from Other seroglOups. Iri brief, the above clearly describes 
thdt transgenic tdbadcd plants expressing the nucleoprotein gene of 
TSWV-BL display resistance to both TSWV and iNSV, and the protection 
5 appears to be mediated by the nucleoprotein against distantly related 
!n§V and by the hticleeprbtein gene ribonucleotide sequence against the 

hdrridldgous and closely related fSWV isolates. This is the first time 
bread spectrum resistance Of the engineered plants to different 

isolates of TsWV has been shown. 

10 While Coat protein protection generally displays delay ahd/or 

reduction In infection and Symptom expression, but ho immunity, the 
present invention provided a significantly high percentage of transgenic 
plants which were symptom-free and free of the Infective virus. 
Resistance of these plants Under greenhouse conditions persisted 

1 5 throughout their life cycle, and more importantly Was inherited to their 
progenies as shown above. 

It was Observed in the present invention that the transgenic 
plants producing little, If any, TWSV-BL NP Were highly resistant to 
infection by the homologous isolate and other closely-related isolates 

20 withirt the Same serogroup of TSWV, whereas no protection Was found 
in those expressing high levels of the NP gene. 

the biological diversity of TSWV is Well documented and has 
been reported to overcome the genetic resistance in cultivated plants 
SuCh as tomato. Thus, It Is extremely important to develop transgenic 

2 5 plants that Shew resistant to many strains of TSWV. The present 
invention indicates that one method to do so would be to utilize the 
Viral NP gene to confer this resistance, and that this resistance would 
be to diverse TSWV isolates. Thus, the finding of the present invention 
that the expression of TSWV NP gene is capable of conferring high 

3 0 levels Of resistance to Various TSWV isolates has a great deal of 
commercial importance. 

In another Series of studies, Pldsmid BIN19-N+ was constructed 
and transferred to A. tuftidteciehs strain LBA4404 in accordance with 
Example IV, and transferred to Nicotians benthamiana in accordance 
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with Example V. The nucleocapsid genes of INSV-Beg and -LI Were 
amplified with oligomer primers INSV-A 

(5'-f ACtTAtCTAGAACCAtGGACAAAGCAAAGATTACCAAGG) and INSV-B 
(S'-fAbAGTGGATCCAtGGttAtttcMATMtttAtAAAAGCAC), 
5 hybridizing to the S'-cddihg and 3'-hohcodlrig regions of the 

riucleocapsid gene of an INSV isolate, respectively. The amplified 
riucieeeapsid gene fragments were purified in accordahce With Example 
III, and digested and Sequenced In accordance With Example IV. 
Of a total of 24 N+ (transformed With pBlNl9-N+) and 18 Kl- 

1 0 (transformed with vector p8lN19) transgenic N. b'enthamiana plants 

were transferred to Soli ahd grown in the greenhouse. All M+ lines were 
confirmed by PCR at leaf Stages 4-5 to contain the N gene sequence, 
the relative level of N protein accumulation was estimated in each 
independent Ro transgenic clonal line by DAS-ELISA Using antibodies of 

1 5 the fSWV-BL N protein. Of the twenty-four Ki+ lines, two had OD4 0 5nm 
feadihgs of 0.50-1.00* seventeen between 0.02-O.iO, and the remaining 
five less than 0.02. Healthy N. behihamiana or transgenic Kl- plants 
gave OD405nm readings of 0.00-0/02. All the Ro plants were self- 
pbllinated and the seeds from the following transgenic lines Were 

20 germinated on kanamyciri (300 |ig/ml) selection medium for inoculation 
tests: (1) N- -2 and -6, control transgenic lines containing vector 
PBIN19 alone; (2) N+-28, a transgenic line that produced an 
undetectable amount of the N protein (OD405nm = 0.005); (3) N+-21, a 
transgenic line producing a low level of the N protein (OD405nm = 

25 0.085); and (4) N+-34 artd -37, two transgenic lines accumulating high 
levels of the N protein (OD405nm = 0.50-1.00. These Six lines were 
then analyzed by Northern hybridization; the intensity of N gene 
transcripts correlated well with the levels of ELISA reactions. 

Transgenic seedlings from the six Ro lines Were selected by 

3 0 germinating seeds on kanamycin selection medium, and these seedlings 
were inoculated with the five To$poviruses. The inoculated R1 plants 
were rated susceptible if Virus Symptoms Were observed on 
unmodulated leaves. Irt order to exclude the possibilities of escapes, 
transgenic control N" plants Were always used In each inoculation of 

3 5 transgenic N+ plants. In addition, each inoculum extract was always 
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used td first inoculate N+ plants followed by control N" plants. The 
results from this series of studies ere depicted below: 

Reactions Of 61 plfihts 6Scf5f§ss1hg the nucledcapSid (N) brotein gene of N 

bmhmim sotted wilt virus (t sWV) to ihdcuiatidfl with Tospoviruses 
Mo. plants infected/No, plants inoculated 0 







TSWVlROUTF 


INSVISOLATR 




ROLine 


* ELISA 3 


BL 16W 


B^d LI 


TSWV-B 




<6.02 


32732 32732 


32/32 20/20 


32/32 


N + -J>8 


0.005 


16/16 16/16 


15/16 


16/16 


1 0 N+-21 


0.085 


9/40 17/40 


39/40 , 18/20 


40/40 


, N+-34 


0.715 


25/28° 28/28 


23/28° v 


28/28 


N+-37 


0.510 


26/28° 22/22 


21/28° 16/20° 


22/22 



a £LISA data of Ro lines from Which the R1 plants Were derived; 

°36-told diluted leaf extracts 6t Infected N. benthamiana plants Were applied to the 
1 5 hree leaves of plants at the 3-5 leaf Stages. Each extract was always used to 

n .° C i at <?.M^! S ,bl, ^ d b V.poWrol N- plants. Data were taken dally for at least 
wo months after Inoculation and expressed as number of plants systemically 

infected/number et plants Inoculated; 

0 n ? h J8X! th9t hfeaH>r su§ ^ptlbie R1 plants displayed a significant delay of 

1 u symptdm appearance. 

As depicted In the above table, all R1 plants from control lines 
N--2 and -6 Showed systemic symptoms 5-8 days after inoculation 
With all the Viruses tested. Norte of the Ri plants from line N+-28 
produced detectable levels of the N protein, and ail were susceptible to 

1 5 these Viruses except tor one p!aht Inoculated with INSV-Beg. ELISA 
assays of leaf discs from this N+-28 R1 plant sampled before 
inoculation clearly showed that the plant Identified to possess the 
INSV-Beg resistant phenetype did accumulate a high level of the N 
protein (OD405rtm * 0.76 fls Compared to OD4o5nrri <0.02 for all other 

3 0 N+-28 R1 plants). 

the low N gene expressing; line N+-21 showed the best resistance 
against the homologous (78%) and closely related TSWV-10W (57%) 
Isolates and very little resistance to the two INSV Isolates (3% and 
10%); only three N+-21 plants showed the resistant phenotype When 

3 5 inoculated with the INSV Isolates. Leaf Samples from these INSV- 

resistant N+-21 ^ plants gave much higher ELISA reactions (OD405nm 
0.5 to 1.00) and thus higher amounts of the N protein than the 
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susceptible N+-21 plants (OD4o5hrti 6.02 to 0.20). The high N gene 
expressing lines N+^34 and -37 showed the highest resistance to INSV 
isolates (18%-25%) fallowed by the homologous TSWV-BL isolate (7% 

arid 11%) while hdh§ of the plants showed resistance to tsWv-loW; 
5 however, the N+-34 and -§i ri plants that became Infected with insv 

br TSWV-BL did shbW various lengths of delays in Symptom expression. 

None of the R1 plants tram these four transgenic N+ lines were 
resistant to tsWV-fy same of the R1 plants from the N+-34 and -37 
lines shbwed a slight delay of symptom appearance 
1 0 In Studies to determine Whether the level of H protein production 

in N+ &1 plants was associated With resistance tb different 
To§pdWrus&s, the inbculated N+ Ri plants in the Receding table were 
fe-brganized into feur groups based on the intensity of their elisa 
reactions of tissues taken before inoculation irrespective of original 

1 5 Rrj pints, the N+ R1 plants that expressed low levels of the N protein 

(0.02-6.2 Ob) Showed high feSiStahce (100% and 80%) to TSWV-BL and 
-16W but Were all susceptible tb INSV-Beg and -LI, showing no 
detectable delay In symptom expression relative to cohtrol N" plants. 
Ih eentrast, nearly all N+ R1 plants With high levels of the N protein 
20 (0.20^1. 60 OD) Showed Various levels of protection against TSWV-BL, 
INSV^Beg and -LI, ranging from a Short delay of symptom expression to 
Complete resistance With most of these plants Showing; various lengths 

of delay in symptom development relative to control N- plants. No 
protection Was observed IH the high expressors against TSWV-1 0W. In 

2 5 addition, none Of the N+ Ri plants Were resistant to TSWV-B regardless 

of the level of N gene expression; however, a short delayed symptom 
appearance was observed in the N+ Ri plants producing high levels of 
the N protein. All control N" fii plants and transgenic N+ Ri plants with 
undetectable ELISA reactions (0 to 6.02 OD) were susceptible to all the 

3 0 TdSpdviruses tested. 

the Inhibition of replication of a distantly related INSV in N. 
bertthamiarta protoplasts expressing the TSWV-BL hUcleocapsid.gene 
WaS also Studied. In these Studies, Whole INSV-LI Virions Were used to 
infect protoplasts that were isolated from three transgenic lines to 
3 5 investigate how the products of the transgene affect replication of the 
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incoming virus. Viral replication was determined by measuring the 
level of the N protein of the infecting INSV in transgenic protoplasts 
using antibodies Specific to the INSV N protein. DAS-ELISA analysis 
Showed that all progenies ffOrh a given line were relatively uniform and 
5 nearly all Ri progeny gave an expression level of transgenic N gene 
Similar to their parental transgenic line, these results allowed for the 
prediction of the expression level of Ri populations based on that of 
their parental lihes. Protoplasts derived from Ri plants of the low 
expressor line N+-21 Supported the replication of INSV-LI Whereas 

10 protoplasts frorrt Ri plants Of the higher expressor line N+-37 did not 
until 42 hours after inoculation at Which low levels of viral replication 
Were Observed. The Same protoplasts at various time intervals (e.g. 0, 
19, 3d and 42 hourS) Were also assayed by DAS-ELISA using antibodies 
Specific to the tsWV-BL N protein to monitor the expression level of 

1 5 the transgene. As expected; protoplast from N+-21 Rt plants produced 
relatively low levels (0.338-0.395 OD405nm) whereas protoplasts from 
N+-37 Ri plants accumulated high levels (0.822-0.865 OD4o5nm)- The 
expression level was found to be consistent at all time points. 

in thiS aspect Of the present Invention it has been shown that 

20 transgenic N. benthmma plants that accumulate low amounts of the 
TSWV-BL N protein are highly resistant to the homologous and closely 
related (TSWV-ioW) isolates, While plants that accumulate high 
amounts Of this protein posses moderate levels of protection against 
both the homologous and distantly related (INSV-Beg and INSV-LI) 

25 ViruSeS. More Importantly, these findings indicate that transgenic fsl. 
benthamiana plants (a systemic host of INSV) are protected against 
INSV-Beg and INSV-LI Isolates. 

As discUSsed above* We have shown that transgenic plants 
expressing the N gene Of TSWV are resistant to homologous isolates, 

3 0 and that such plants expressing the TSWV-BL N gene are resistant to 
both TSWV and INSV. It has also been shoWh the best resistance to 
homologous and closely related Isolates was found in transgenic plants 
accumulating low levels Of N protein while transgenic plants with high 
levels Of TSWV-BL N protein were more resistant to serologically 

3 5 distant INSV isolates. This observation led us to suspect the role of 
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the translated N protein product In the observed protection against 
hbrfiolbgous and closely related isolates and to speculate that either the 
N gene Itself Whlbh Was inserted Into the plant §§rtbme or its transcript 
waS irtvblved Ih the pfbtectibh. to test this hypothesis transgenic 
5 plants containing the prbfflbterleSS N gehe br expressing the sense or 
antisense untranslatable N boding sequence were prbdubed. What was 
disbbvered was that bbth sense and antisense untranslatable N gene 
RNAs provided pfbtedibn against homologous and blosely related 
iseiates, and that these RNA-mediated pretectibhs were most effective 
1 0 in plants that synthesized IbW levels of the respective RNA species and 
appears to be achieved through the inhibition of viral replication. 

More specifically, the boding sequences introduced into 
transgenic plants is shown in figure 7. As depicted, the construct 
PBIN19-N contains the promoterless N gene inserted ihto the plant 

1 5 transformation vector PBIN19 (see Example IV). All other constructs 

contain a double 35S promoter of CaMV, a 5*-uhtranstated leader 
sequence of alfalfa mosaic virus and a 3'-untrahSlated/pblyadenylation 
sequence of the nopalirte synthase gene. pBI525 Is a plant expression 
vecdr ahd is used in this Study as a control; pBI525-mN contains the 

2 0 mutant (untranslatable) fdrm of the N gene; pBl525-asN contains the 

antiSense form of the untranslatable N gene. One nucleotide deletion at 
the ^terminus of the mutant N gene is indicated by the dash Symbol. 
ATG codons are underlined and inframe termination codons in the 
mutant gene are shbWn in bold. 
25 EXAMPLE VIII 

Primer-directed mutagenesis and cloning of the TSWV-BL N gene 
was conducted as follows: 

Full-length N gene Was obtained by reverse transcription and 
polymerase chain reaction as described in Phytopathology 82:1223 

3 0 (i9§2), the disclosure bt Which is Incorporated ih idh herein. The 

untranslatable M coding Sequence was similarly generated by RT-PCR 
using bligorher primers A 

(AGCAttGGAtCCAtGGTtAACAcACTAAGCAAGCAC), Which Is identical to 
the S RNA in the 3'-nonbbding region of the TSWV-BL N gehe, and B 
3 5 (AGCTAATCTAGAACCAtGGATGACTCACTAAGGAAAGCATTGTTGC), 
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complementary to the 6 FIN A In the S'-terminus of the N gene. The 
tetter oligomer primer contains a Irameshift mutation Immediately 
after the translation initiation fcoddri and several termination codons to 
• bldck possible translation readthroudhs. the intact and mutant N gene 

5 fragments were purified en a 1.2% agarose del as described in Example 
li. the gel-jsblated intact and mutant N gene fragmehts were digested 
with the appropriate restriction enzyme(s) and directly cloned Into 
BamHI/Kbal-digested plant transformation vector p8lr\H9 and hcoi- 
dipsted blant expression vector pBi525, respectively as described in 
1 0 Example IV. the resulting plasmids Were Identified' and designated as 
6BIN1§41 containing the Intact, promoterless N gehe, and pBI525-mKl 
and pBi$25-asN containing the mutant coding sequence in the sense and 
antisense orientations, respectively, relative to cauliflower mosaic 
virus 3S§ promoter, the translatability of the mutant N coding 

1 5 sequence in the expression cassette was checked by transient 

expression assay in Nicotians tabacuhi protoplasts; ahd the expression 
cassettes containing the sense or antisense mutant N coding Sequence 
were then excised from plasmid pBi525 by a partial digestion with 

Hindlli/EOORI (since the N Ooding sequence contains Internal Hindlll and 
20 ExoRi Sites), and ligated into the plant transformation vector pBIN19 
that had been cut with the same enzymes. The resulting vectors as well 
as pBIN19-N were transferred to A. tumefaciens Strain LBA4404 using 
the procedure described in Example IV. Leaf discs Of N. tabacum var 
Havana CV 423 Were Inoculated With the A. tumefacierts strain LBA4404 

2 5 Containing various constructs and the resulting transgenic plants were 

seif-pellirtated and SeedS were selectively germinated on kanamycin 

medium. 

PGR Was performed oh each Rrj transgenic line as described 
above, the oligomer primers A and B Were used to determine the 

3 0 presence of the N coding Sequence of TSWV-BL. the oligomer primer 

35§-premoter (GCCActAtCCTTCGCAAGACCC) Was combined with either 
the Oligomer primer A or B to confirm the orientation (relative tojhe 
C)aMV 3SS promoter) of the mutant N coding sequence Inserted into the 
plant genome. DAS-ELISA used to detect the N protein In transgenic 
3 5 plants was performed Using polyclonal antibodies against the TSWV-BL 
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N prbiein. F6C an estimation of UNA transcript level in transgenic 
plants by Northern blot, total plant RNAs Were isolated according to 
Napoti [see the Plant Cell 2:279 (1990)], and Were separated on a 
fofrtialdehyde-bontalHirtg agarose §§! (10 U^/iane). The agarose gels 
5 wete then stained With ethldiurn brdWide td ensure uniformality of 
total plant RNAs In each lane. Hybridization conditions were as 
deseHbed in the GeheSereen Plus protocol by the manufacturer. 
Resulting signal blots were compared and normalised based on the N 
gene transcript band of the control lane (the mN tit plant producing a 
1 0 high level of the N gene transcript) included in each blot. The 

transgenic plants that gave density readings (Hewlet ScanJet and Image 
Analysis Program) between 100 and 150 Were rated as high expressors, 
while the plants With densities between 15 and 50 Were rated as low 
expressors. 

1 5 Inoculation of transgenic plants With Tospovirus was carried out 

as described above with Inoculation being done at the 3-4 leaf stage 
except were Indicated. 

Tobacco protoplasts were prepared from surface-sterilized 
leaves derived from Rt plants [see Z. Pflanzanphysiol. 78:453 (1992) 
20 With modifications). The Isolated protoplasts (6 x 10& protoplasts) 
were transformed With 0.68 OD260nm of the purified TSWV-BL virion 
preparation using the PEG method [see Plant Mol. Biol. 8:363 (1987)]. 
The transformed protoplasts were then cultured at the final density of 
1 x 10 6 protoplasts /ml in the culture medium at 26°C in the dark. 

2 5 After various Intervals of Incubation, the cultured protoplasts were 

Washed twice with W5 Solution and lysed by osmotic shock in the 
enzyme conjugate buffer. Viral multiplication (replication) was 
estimated by measuring the N protein of the ViruS USing DAS-ELISA. 
As described, one aspect of the present Invention demonstrated 

3 0 that transgenic tobacco producing none or barely detectable amounts of 

the N protein were resistant to homologous and closely related isolates. 
This result suggested that the observed resistance may have been due 
to trans interactions of the incoming viral N gene RNA with either the N 
gene transcript produced In the transgenic plants or the N coding 
3 5 sequence itself. To test Whether the presence of the nuclear N gene 
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plays a role, transgenic P 6 N Hq lines and Ri plants from two P°N lines 
Were challenged with four tdSpoviruses (TSWV-BL, TSWV-iOW, INSV- 
6eg ahd TSWV-B). Only asymptomatic plants were rated resistant 
While plants showing any symptoms Were fated susceptible. All 
5 IhdcUiated tto and R1 plants were susceptible to th§ viruses. 

tb further test th§ possibility that the transcript of the N 
transgerie Is involved In the protection, a number af Rq transgenic 
plants ihat produced either the sense or the antisense N gene transcript 
but H6t the N proteiH Were Inoculated With the homologous isolate. 



i 0 Results appear in the following table: 

P6ffi 6f Level of N No. of RO No. of lines No. of lines 

trahsgehea gene RNAb lihes tested inoculatedc resistant 

mN H 8 4 o 

L 17 16 16 

15 nd 4 1 o 

asN H 6 3 0 

L 9 5 5 

nd 1 o 0 

P°N nd 12 6 o 



20 a mN arid afeN represent plarits expressing the sense and antisense untranslatable N 
O&rteSi respectively, P°N represents plants containing the promoterless N gene; 
bthe I6V6I of the H gene RNA Was estimated in each line by Northern blots, nd 
Indicates that the N gene transcript was not detected; 

c 3ti-told diluted leaf extracts Of the N. benthamiana plants Infected With TSWV-BL 

2 5 Were applied to three leaves of each plant at the 6-7 leaf stage. Each extract Was 

first applied to ail test plants fallowed by control healthy plants. Data were taken 
daily fdf 45 days after Inoculation and only the asymptomatic plants Were rated 

resistant. 

Unlike the cdhtrols, Which developed typical systemic symptoms 

3 0 7 to 9 days after inoculation, 16 out of 21 mN plants and 5 our of 8 asN 

plants Were asymptomatic throughout their life cycles. Northern blot 
analysis of leaf tissues sampled before inoculation Showed that all the 
resistant Ro lines produced low levels of the sense or antisense N gene 
RNA, Whereas the susceptible Ro lines produced either none or high 
3 5 levels Of the RNA species. Since this data suggested that the 
resistance of transgenic plants to TSWV-BL was related to their 
relative levels of N gene transcript, transgenic progenies from four mN 
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and three asN R 0 lines with either high or low N gene transcript levels 
were selected by germination on kanamycin-containing media. These 
transgenic plants were tested for resistance to the four Tospoviruses 
at the 3 to 4 leaf stage, except that some Ri plants from two asN lines 
Were inoculated at the 6 to 7 leaf stage. The results are summarized in 
the fallowing table: 
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All R1 plants from high expressor lines mN-2 and mN-7 were 
susceptible to infections by all tbspoviruses tested, and these plants 
did Hot show a delay 6t symptom appearance as compared to controls. 

In contrast, high proportions Of the Hi plants from IbW expressor lines 
5 mN-13 arid -18 were fesl§tant tb homologous (tsWV-BL) ahd closely 

related (TSWV-10W) isolates, but not resistant to infections by 
distantly related td*/*dWft*dd (INSV-Beg ahd tsWV-B). the resistance 
of asN Ri plants from low expressor Ro lines was markedly Influenced 
by the fswv isolate Used for Inoculation. All but one of the small Ri 
10 plants (34 leaf Stage) tfOm low expressor lines as'N.t and -9 became 
infected, although there Was a delay of symptom appearance, when 
inoculated with the homologous tswV-bl or closely related tswv-iow 

isolates. In contrast, most Of the large Ri plants (6-7 leaf stage) from 
line asN-9 were resistant to both isolates. In comparison, control Ri 

1 5 plants ahd Ri plants from the high expressor line such as asN-4 

displayed no resistance to either of the isolates regardless of the size 
Of test plants. Antisense RNA-mediated protection was not effective 
against Infection by the distantly related INSV-Beg and tsWv-b 
isolates. 

2 0 Analyses Of data prasented In the above two tables suggest that 

sense and antisense RNA-mediated protections are observed only in low 

expressbrs Of the N gene; the Ri asN plants that produced high levels 
Of the antisehse N gene transcript Were as susceptible as control 
plants; In contrast, the asN lOW OxprOssors displayed a delay in 

25 symptom appearance when inoculated at the 3-4 leaf stage and showed 
increased levels of resistant when inoculated at the 6-7 leaf stage. 

Inhibition of viral replication in tobacco protoplasts expressing 

the sense or antisense form ol untranslatable M coding sequence was 
also noted. In this Instance, whole virion preparations of tswv-bl 

3 0 were used to transact protoplasts isolated from transgenic lines to 
investigate the effect of sense or antisense N gene transcript on 
replication of the Incoming Virus. Viral replication Was determined by 
measuring the level Of the N protein of the Incoming virus In 
transfected protoplasts, find It was found that protoplasts derived , from 

3 5 plants (mN-7 and asN-4) that produced high levels of the respective 



RNA transcripts Supported the replication of the Virus, whereas 
protoplasts from rtlN low gfcpressor (mN-i8) did riot. Protoplasts from 
an asN loW expreSSdr (asN-9) supported much lower levels of viral 
replication. 

5 Accordingly* in this aspect of the present invention We have 

shown that transgenic plants expressing sense ar ahtisense form of 
untranslatable N gene adding sequence are resistant to homologous 
(f§WV-SL) and closely related (tsvW-ioW), but hot to distantly 

related (INSV^eg* find tSWV-B) tospoviruses. the following table 
10 provides a cdmparlsbh of resistance to Tospovitus&s between 

transgenic tobacco expressing; various forms of the tsWV-bl n gene: 

pnrm nf the Transaene a 

Homology to 

Toaoovirus . TSWV-BL N Gene b JJ ffiN asM EZN 
15 TSWV-BL 100% R R R c S 

TSWNM OW 99% R R R c S 

INSV-Beg 60% rc s S S 

TSWV-B 78% S S S S 

^reactions of transgenic! tobacco and N. benthamiana plants expressing the intact N 
2 0 geh§ (N) 5f f SWV-BL td inoculation With the four Tospovlruses are included for 

comparisons with indexation results of transgenic plants containing untranslatable 
(rtlN) i antisarise (aSN) ( and promOterieSS (P°N) N coding Sequences , R = resistant, 
S * susceptible; 

b th§ nucleotide Sequehces are as reported In Phytopathology 82:1223 (1992) and 

2 5 Phytopathology 83:728 (1993) 

fc iaV§l Of resistance may depend Upon the concentration of Inoculum. 

these results confirm and extend the earlier aspects of the 

present invention for RNA-mediated protection with tsvvv. 
Furthermore, the prdtectibh IS observed in plants producing low rather 

3 0 thah high levels Of the N gehe transcript, and although earlier studies 

reported herein indicate that tobacco plants Which produced high levels 
Of the tSWV-BL N protein displayed resistance to JNSV-Bfeg, this 
additional data indicates that since resistance to iNsv-Beg was not 
Observed in transgenic plants expressing the SenSe or antisense form of 
3 5 the untranslatable of the Kl gene thus clearly indicating that protection 
against INSV-Beg IS due to the presence of the N protein and not the N 
gene transcript. Thus, It appears that two different mechanisms are 
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involved ih protection transgenic plants against TSWV and INSV 
tosfidviruses according; to the present invention. Ohe mechanism 
Involves the N gene transcript (ftNA-mediated), and another involves the 
N pfOteih (protein-medicated). In addition, the results of the protoplast 
5 experiments indicate that N gene RNA-mediated protection is achieved 
through a -process that inhibits viral replication, and the data contained 
In the above tables suggest that protection against the distantly 
related iN6V*Beg isolate is conferred by the N protein of tsWV-bI, and 
not by the gene transcript. 

10 fthaily, further studies were conducted to provide still another 

aspect of the present invention - that a portion of the Tospovirus 
nucieeprotein gene provide protection of transgenic plants against 
infection by the TdSpOWfUS. It has already been demonstrated above 
that the N gene RNA protects against homologous ahd closely realated 

1 5 fSWV Isolates While the N protein protects agalnSt the homologous 

Isolate and distantly related IKISV Isolates; that N gene RNE-mediated 

protection is effective in plants expressing low levels of the N gene 

Whereas N protein-mediated protection requires high levels of N protein 
accumulation; and that the N gene RNA-mediated protection is achieved 
20 through Inhibition Of Viral replication. Based upon this prior data, we 
next Set out to determine whether a portion of the N gene might work 
against infection by the virus. We found, as discussed below, that 
transgenic plants expressing about one-half of the N gene sequence is 
resistant to the virus. 

2 5 The following describes the cloning of one-half N gene fragments 

of tSWV-BL Irt order to demonstrate this final aspect of the present 

invention, the first and second halves of both the translatable and 
untranslatable N gene were generqated by reverse transcription and 
then f>CR as described above. As depicted in figure 8, the nucleotide 

3 0 deletion or insertions at the 5'-termlnals of the untranslatable half N 

gene fragments are Indicated by the dash symbol; AtG codons are 
underlined and all possible termination codons immediately after the 
initiation codon of the untranslatable half N gene fragments are Shown 
In bold. co 
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the first half of the N gene was produced by RT-PCR using 
oligobflmers I (S'-tAGAGT GGAtCCATGGTTAAGGt AAtcCATAGGCTTGAC) 
whleh ts complementary te the central region ef the tsWv-bl n gene 
ahd II (S'-AGOf MdCAf GGftMGCtcAcf AAGGAAAGCATTGTTGC) for the 

5 translatable or m 

(5'-A(3GtMtCtAGAACCAtGGATGACTCACtAAGGAAAGCATTGTTGC) for 
the untranslatable first half N gane fragment, the latter two oligomer 
iptmm are identical to the 5'-termihus of the N dene, similarly, the 
seeend half of the N gene was produced by Rt-PcR using oligomer 

1 0 primes IV (& , -AGCAtTT3GAtCCATGGTTMcACACtAAGCAAGCAC) which 

Is complementary td the 3'-hcncodihg region of the TSWv-BL N gene, 
ahd V (S'-TACAGTTCtAdAACCATGGATGAtGCAAAGTCTGTGAGG) for the 

translatable of vi 

(5-AGA1rTCTCtAGACCAtGGTGACTTGATGAGCAAAGTCTGTGAGGCTTGC) 
15 fof the untranslatablesecond half N gene fragment, the latter two 

oligomer primers are Identical to the central region of the N gene. The 
oligomer primer lii contains a frameshift mutation Immediately after 
the translation codon and several termination codohs to block possible 
translation readthroughS while the oligomer primer vl contains several 

2 0 ihframe termination codohs Immediately after the translation 

initiatlen codon. 

the halt gene fragments were purified on a 1.2% agarose gel as 
described above, ahd the gel-Isolated gene fragments were digested 

with the restriction enzyme Nco\ and directly cloned into Nco\ 

2 5 -digested plant expression vector pBI525. The reslutlng plasmids were 

identified and designated as (1) P BI525-1N containing the first half 
translatable N gene, ,(2) PBI525-W containing the first half 
untranslatable N gene, (3) PBI525-1N- containing the first half 
translatable N gene In the antisenSe orientation, (4) pBI525-2N 

3 0 containing the secbnd half translatable N gene, (5) pBI525-2n' 

containing the second half untranslatable N gene, and (6) pBI525-2N- 
contalning the second half translatable N gene In the antisense- 
orientatioh. the expfesslrt cassettes were then excised from plasmid 
PBI525 by digestion with HihdUVEcoRl and ligated as described above 
3 5 into the plant transformation vector P BIN19 that had been cut with the 
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same enzymes. The resulting vectors as well as plasmid pBlNl9 were 
transferred to A. tumeteci&hS strain LBA4404, using the procedure 
described by Holsters SUpra. Leaf discs of N. behthamiana were 
modulated With A. tumemms Strain LBA4464 containing the various 
5 constructs. Transgenic plants were self-pollinated and seeds were 
selectively germinated on kartamycin as described above. 

Analysis Of transgenic plants by PCR and Ndrthern hybridization 
PCR Wa§ performed ort each Rrj transgenic line as described previously, 
the oligomer primers I to vi were used to determine the presence of the 
1 0 N ceding sequence of TSWV-BL. The oligomer primer 35S-Promoter (see 
Example vni) was eombined with one of the above oligomer primers to 
confirm the orientation (relative to the caMv/ 35S promoter) of the half 
§ert§ Sequences inserted into the plant genome. Northern analysis was 
conducted as described in Example vm. 

1 5 Lettuce isolate of tSWV (TSWV-BL) was used to challenge 

transgenic plants, inoculation was done using test plants at the 3-4 

leaf stage as described above. To avoid the possibility of escapes, 
Control pants were used in each experiment and each inoculum extract 
Was used to first inoculate the transgenic plants followed by control 

2 0 plants. 

the various Constructs used in this aspect of the present 
invention are Illustrated ih figure 6. Translatable and untranslatable 
half N gene fragemntS were synthesized by RT-PCR and then cloned 
directly into the plant expression vector pBI525. The oligomer primers 

2 5 lii and vi, used for generation of untranslatable half N gene fragments 

by ftt-PCR, Contains a mutation Immediately after the translation 
initiation codort and the resulting reading frame Contains several 
termination CodohS to block possible translation readthroughs. Thus, 
both first and second half untranslatable N gene fragments should be 

3 0 incapable Of prodcing the truncated N protein fragments when 

introduced into plants. Both trahslatable and untranslatable half N gene 
fragements were then placed downstream of the CaMV 35S promoter of 
the vector pBI525 Irt the Sense Orientation or in the antisehse 
orientation. The expresSlh of the half N coding sequences of TSWV-BL 
3 5 was thus controlled by a double CaMV 35S promoter fused to the 5'- 
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untranslated leader sequence of alfalfa mosaic virus (ALMV) of the 
expression vector pBI625 ; Expression vectors that utilize the Stacked 
double CaMV 35S pfdrrtdtef elements are known to yield higher levels of 
mfiNA trnscription than Similar vectors With a Sirtgle 35S promoter 

5 element Expression cassettes were transferred from the vector 
pBlS2fi to -the pant transformation vector pBlNi9. the resluting 
piasmids as well as the control plasmid pBINt9 were then transferred 
intd A. tumteciehi Strain LBA4404. Transgenic plants Were obtained 
with namenclature of the transgenic lines shown in figure 6. 

1 0 All the kanamycin-resiStant transgenic lines' Were confirmed by 

PCR te contain the proper N boding sequences in the expected 
orientations. Each transgeinc Ro line which was grown for seeds was 
then essayed using Ndrthem blot. Six out of six IN, four out of Six 1N', 
Six but of six 1N-, Six 6Ut bf Six 2N, seven out of eight 2N'> and six out 

1 5 bf seven 2N" transgenic Rrj lines were found to produce half N gene 

RNAs. 

A Set of transgenic &o plants was challenged with the 
homologous isolate tSWV-BL. Only asymptomatic plants Were rated 
resistant while the plants showing any symptom (local lesions or 
20 systemic infections) were rated susceptible. All the inoculated Rn 
control plants were Susceptible to the virus; in contrast, two out of 
nine 1N\ two but Of Six 1N-, tour out of ten 2N', artd one out of eight 2N" 
Rrj lines were found to be completely resistant to the virus infection. 
Although horte bf the IN and 2N Rrj lines showed high levels of 

2 5 resistance, Some Of thOSe plants displayed significant delays of 

symptom appearence. 

Another Set of transgenic Rrj lines Was brought to maturity for 
seed production. Seedlings were germinated on kanamycin-containing 
medium and Inoculated With TSWV-BL. As shown in the following table, 

3 0 control seedlings and SeedllhgS from some of the transgenic lines were 

susceptible to the isolate whereas seedlings from lines 1N-151, iNr- 
123» and 2N'-134 showed variojs levels of protection t ranging jrom 
delays in symptom expression to compete resistance. 
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. No. Plants Infected/No- plants inflguiatsd 

RO 1^6 BDP! 15DPI 30DPI 

Control 50/50 

1NM49 17/17 

5 1NM51 2/20 13/20 17/20 

1NM23 16/20 17/20 17/20 

1N'-124~ 20/20 

1^126 19/19 

1KM30 12/15 15/15 

10 1N^132 18/19 19/19 

2N*155 20/20 ' 

2NM34 0/20 10/20 10/20 

2NM35 19/19 

2N i -142 20/20 

15 2NM43 20/20 

J? ,he , a ! 50v6 tab,e . diluted extracts of Infected N. benthamiana ''were 

Used to Inoculate tittJWHK! plants at the 3-4 leaf stage followed by control 
transgenic plants. DPI a days post Inoculation. 

In surtirhary, this aspect of the present invention shows that 

20 traftsgehic plants expressing the fifst or the second half of either 

translatable of- untranslatable N gene fragment are highly resistant to 

the homologous TSWV-BL isolate. This result demonstrates that a 

portin of the N gene is sufficient for resistance to the virus. 

A listihd of all hUcleotide and amino acid sequences described in 

2 5 the foregoing description of the present invention is as follows: 

SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Dehnis Gonsalves and Sheng-Zhi Pang 

(ii) TITLE OF INVENTION: Tomato Spotted Wilt Virus 

3 0 (iii) NUMBER OF SEQUENCES: 30 

(2) INFORMATION FOR SEQ ID NO:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

. (B) TYPE: hucleic acid 

35 (C) STHANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1 : 
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AGCAGGCAAA ACTOGCAGAA CTTGC 25 
(2) INFORMATION FOR SEG ID N02: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 
5 (B) TYPE: rtUcldic acid 

(C) StftANDfeDNESS: single 
- (D)tomeQV:. linear 

(ii)MoLEcUL£TYI%: DNA 

(xl) SEQUENCE ti§SCRlPtlON: SEQ ID N02: 

1 0 GCAAtfftC^G OGA GTlTltjC CT3CT 25 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
1 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

AQCiAAcxM GGTIAAGCTC ACTAAGGAAA GC 32 

2 0 (2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) StftANDEDNESS: single 

2 5 (D) TOPOLOGY: linear 

(H) Molecule type: dna 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

AGCATTCCAT GGTTAACACA CTAAGCAAGC AC 32 
(2) INFORMATION Fan SEQ ID NO:5: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2265 base pairs 

(B) TYPE: nucleic acid 

(C) StftANDEDNESS: single 

(D) TOPOLOGY: linear 
35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
CAAGTTGAAA GCAACAACAG AACTGIAAAT TCTCTTGCAG TGAAATCTCT 50 

GCTGATGTCA gcagaaaaca acatcatgcc taactctcaa gcttccactg 100 

AlltTCAITr CAAGCTGAGC CICIGGCTAA GGGTTCCAAA GGITITGAAG 150 
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CAGGffitECA TICAGAAAiT GTTCAAGGTT GCAGGAGATG AAACAAACAA 200 

AACATTiTAT TTATClMTG CCTX3CAITCC AAACCATAAC AGTGTTGAGA 250 

CAGCTflAAA GMMJM^ AiTtGCAAGC AlCAGCTCCC AA1TCGCAAA 300 

tgcAAAgctc criTiGAAri? McAAixsms rnrcrGArr taAaggagcc 350 

iTACAACArr GTTCAiXaACd GfrXAlACCC CAAAGGATCG GtTCCAATGC 400 

TCTOGGllCGA AACTCAGACA flC^frGCACA AGTTCTTTGC AACTAACTTG 450 

CAAGAAGATG TAAlX3¥lGlA GACTTTGAAC AACCTTGAGC TAACTCCTGG 500 

AAAtMAGAf ftAGGTGAAA GAACXTPGAA TTACAGTGAA GATGOCTACA 550 

AAAGGAAAlA TTiXXtfrtCA AAAACACTTG AATGTCTTCC ATCTAACACA 600 

CAAAG^ATGt CftACTlAGA GAGCATCCAA ATCCCITCAT GGAAG&TAGA 650 

CTTfGCCAGA GGAGAAATIA AAATITCTCC ACAA1CEAJT TCAGTTGCAA 700 

AATCWTGTT AAAGCflteA'f TEAAGCGGGA TCAAAAAGAA AGAATCTAAG 750 

GTTAAGGAAG CGl!AIGCrrC AGGATCAAAA TAATCTTGCT TTGTCCAGCT 800 

TiTfCEAArf AIUriM^lT TAllTlUriT CTITACTIAT AATTATTTCT 850 

CTGITTGTCA KSuiTltAA A1TCCKXTG TCTAGTAGAA ACCATAAAAA 900 

GAAAAAAiAA AAATGAAAAT AAaAITAAAA TAAAAlAAAA TCAAAAAATG 1000 

AAA£aAAAAC AACAAAAAAf 'EAAAAAACGA AAAACCAAAA. AGACCCGAAA 1050 

GGGAOGAAJrr ^GGCGAAA3*T tGGGTITTCT TlTlUrmT TGTTTTTTGT 1100 

ill'i'iM^rrf TiArttrAiT ttTArrriAr tttrititja ttttaititt 1150 

ATl*i^VlTm. Tl'i'ri'llaiTi' toGTTCTTTT TGTTATITTA TTATTTATEA. 1200 

AGCAGAACAC ACAGAAAGCA AACTTTAATT AAACACACTT ATCTAAAATT 1250 

TAAGAGACTA AQCAAGCAcA AQCAATAAAG AIAAAGAAAG CTTTAIAIAT 1300 

TrAlAGGCTT TTTTATAATT TAACTTACAG CTQCTTTCAA GCAAGITCTG 1350 

CGAGTTiTGC CTGCnTTTEA ACCCCGAACA TITCAIAGAA CITGTEAAGA 1400 

GTT f KA£FGT AAlX?rT0CAT AGCAACACTC CCTTTAGCAT TAGGATTGCT 1450 

ggAgc^aAgt AeAgcagcat Actctitccc crrcrrcAcc tcatcttcat 1500 

TCAll^CAAA TGCTrtX3CTT iTCAGCACAG TGCAAACTIT TCCTAAGGCT 1550 

1CCTTGGTGT CAIACTrCTT TGGGTCGATC CCGAGGTOCT TGTAITITGC 1600 

ATCGiiGAlAT AIAGCCAAGA CAACACTGAT CATCTCAAAG CTATCAACTG 1650 

AAGGAASAAG AGGTAAGCTA CCTCOCAGCA TIATGGCAAG TCTCACAGAC 1700 

ITIGCATCAT CGAGAGGTAA TOCAlAGGCT TGAATCAAAG GATGGGAAGC 1750 

AATfflAGAi? TTCAIAGEAT TGAgATTCTC AGAATTCOCA GTTTCITCAA 1800 

CAAGCGTGAC CCTGATCAAG CIAiGAAGCC TTCTGAAGGT CATGTCAGTG 1850 

CCTCCAATCC tgtctgAagt TTTCTTTATG GTAATTTTAC CAAAAGTAAA 1900 

ATCGGlTTGC TTAATAACCT TCATTATGCT CTGACGATTC TTEAGGAATG 1950 
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TCAGACATGA AAIAACGCTC ATLT1LT1GA. TCTGGTCGAT GTTTTCCAGA 2000 
CAAAAAGTCT IGAAGtIGAA TQCTACCAGA. TICTGATCIT CCTCAAACTC 2050 
AAQGICTTTG CCrtXattSDCA ACAAAGCAAC AATGCTTICC TTAGTGAGCT 2100 
TAACCTIAGA CATG^IGATG GTAAAAGTTG TIA1AGCTTT GACCCTATGT 2150 
5 AACKAAGGT GCGAaAGTGC AACTCTGTAT CCCGCAGTCG ' TTICTTAQGT 2200 
Tt5 i lAATC#G AllM'iUtalA AGACTGAGTG TTAACGTATG AACACAAAAT 2250 
TGACAOGATT GCtfCT 2265 
(2) INPORMAflON POR §Ed lb NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

10 (A) LENGTH: 1709 base pairs' 

(B) TVPE: nucleic acid 

(C) StRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

1 5 (xl) SEQUENCE DESCRIPTION: SEQ ID NO:6: 



AAATTCTCTT 


GCAGTGAAAT 


CTCTGCTCAT 


GTTAGCAGAA 


AACAACATCA 


50 


TX3CC1AACTC 


tcaagcttit 


CTCAAAGCTT 


CTACTGATTC 


TAATTTCAAG 


100 


CTQAQCCTCT 


GGClAAGGGT 


tccaaaggtt 


TTGAAGCAGA 


TTTCCATTCA 


150 


GAAATTGTTC 


MGGT/IGCAG 


GAGATGAAAC 


AAATAAAACA 


TTTTATTTAT 


200 


CTMTGCCTG 


cattccaaAc 


CATAACAGTG 


TTGAGACAGC 


nTAAACATT 


250 


ACfTVilTATiT 


GCAAGCATCA 


GCTCCCAATT 


CGTAAATGTA 


AAACTCCnT 


-300 


TCAAfiATCA 


ATGATliiTiT 


CTGATTTAAA 


GGAGCCTTAC 


AACATTATIC 


350 


AltMCCiTC 


ATATCXTCAA 


AGGATTGTTC 


ATCCTCTGCT 


TGAAACTCAC 


400 


ACAifeTnTG 


CACAAGTTCT 


TTGCAACAAC 


TTGCAAGAAG 


ATGTGATCAT 


450 


cmcAccrtG 


MCAACCATC 


AGCTAACTCC 


TGGAAAG1TA 


GATTTAGGTG 


500 


AAAlAACttT 


GA&riACAAT 


gaagacgcct 


ACAAAAGGAA 


ATATTTCCTT 


550 


TCAAAAACAC 


TTGAATGTCT 


TCCATCTAAC 


ATACAAACTA 


TGTCTTATTT 


600 


AGACAGCATC 


OW^CCCTT 


CCTGGAAGAT 


AGACTTTGCC 


AGGGGAGAAA 


650 


rtAAAATTTC 


TCCACAATCT 


ArrrcAGTTG 


CAAAATCTTT 


GTTAAATCTT 


700 


GAift^IAAGCG 


GGATTAAAAA 


GAAAGAATCT 


AAGATTAAGG 


AAGCAIATGC 


750 


TTCAGGATCA 


Aaatcatctt 


GCTGTGTCCA 


GLTlTl'lUiA 


ATTATGTTAT 


800 


bT^lAi'lTlC 


ITlUiTlACT 


TATAATTA1T 


IT1U1UJ.T1U 


TCAiTTcrrr 


850 


CAAAttCCiDC 


CTGTCTAGTA 


GAAACCAIAA 


AAACAAAAAT 


Aaaaataaaa 


.900 


TAAAATCAAA 


AtaaaataAA 


Aaicaaaaaa 


TGAAATAAAA 


GCAACAAAAA 


950 


AAiTAAAAAA 


caaaaaacca 


AAAAAGATCC 


CGAAAGGACA 


ATTTTGGCCA 1000 


AATflGGGGT 


TTGnTlTSr 


TTTTTGTTTT 


TTTGTTTTIT 


GTITTTATTT 1050 
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TTATT1T1AT TiTlArTm' AITTIATTTT ATTTTATGTT TTTCTTGITT 1100 
TTGTTATTTT GTTATTTATT AAGCACAACA CACAGAAAGCA AACTTTAAT 1150 
TAAACACACT TZVITTAAAAT TTAACACACT AAGCAAGCACA AACAATAAA 1200 
GATAAAGAAA GCTTIATATA ITIAIAGGCT TTITEATAAT TTMCTTACA 1250 
5 GCTGCTTTEA AGCAAGTTCT GTtaAGTTTTG CCTbTlTlTl' MCCCCAAAC 1300 
ATTTCATAGA ALUlUiTAAG QGTTTCACTG TAATGITOCA TAGCIAATACT 1350 
TCCTTTAQCA TTAGGATTGC TtSGAGCTAAG TATAGCAGCA TACTCTTTOC 1400 
(XTiUl'itAC CTGAICTTtA TTCAITTCAA ATGLT1T1CT TITCAGCACA 1450 
GTGCAAACTT TTCTTAAGGC TTOXTGGTG TCATACTTCT iTQGGTCGAT 1500 

1 0 (XCGAGATCC TTGTATTTTG CATCCIGATA TATAQCCAAG ACAACACTGA 1550 
TCAiX^TCAAA GCTATCAACT GAAGCAATAA GAGGTAAGCT ACCTCOCAGC 1600 
ATTATGGCAA GCCTCACAGA CIlTGCATCA TCAAGAGGTA ATOCATAGGC 1650 
TTGAATCAAA GGGTGGGAAG CAAICTTAGA TTTCATAGTA TTCAGATTCT 1700 
CAGAATTCC 1709 

1 5 (2) INFORMATION FOR SEQ ID NO:7: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



Gin 


Val 


Glu 


Ser Asn 


Asn Arg Thr Val 


Asn 


Ser 


Leu 


Ala 


Val 


Lys 










5 






10 










15 


Ser 


Leu 


Leu 


Met 


Ser 
20 


Ala 


Glu Asn Asn 


Ile 
25 


Mat 


Pro 


Asn 


Ser 


Gin 
30 


Ala 


Ser 


Thr 


Asp 


Ser 
35 


His 


Phe Lys Leu 


Ser 
40 


Leu 


Trp 


Leu 


Arg 


Val 
45 


Pro 


Lys 


Val 


Leu 


Lys 
50 


Gin 


Val Ser Ile 


Gin 
55 


Lys 


Leu 


Phe 


Lys 


Val 
60 


Ala 


Gly 


Asp 


Glu 


Thr 
65 


Asn 


Lys Thr Phe 


Tyr 
70 


Leu 


Ser 


Ile 


Ala 


Cys 
75 


Ile 


Pro 


Asn 


His 


Asn 
80 


Ser 


Val Glu Thr 


Ala 
85 


Leu 


Asn 


Ile 


Thr 


Val 
90 


lie 


Cys 


Lys 


His 


Gin 
95 


Leu 


Pro Ile Arg 


Lys 
100 


cys 


Lys 


Ala 


Pro 


Phe 
105 


Glu 


Leu 


Ser 


Mat 


Mat 
110 


Phe 


Ser Asp Leu 


Lys 
115 


Glu 


Pro 


Tyr 


Asn 


lie 
120 


Val 


His 


Asp 


Pro 


Ser 
125 


Tyr 


Pro Lys Gly 


Ser 
130 


Val 


Pro 


Met 


Leu 


Trp 
135 
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Leu Glu Thr His Thr Set Leu His Lys Phe Phe Ala Thr Asn Leu 

ifO 145 150 

Gin Glu Asp Val lie lie Tyr Thr Leu Asn Asn Leu Glu Leu Thr 

155 160 165 

5 Pro Gly Lys Leu Asp Leu Gly Glu Arg Thr Leu Asn Tyr Ser Glu 

170 175 * 180 

Asp Ala Tyr Lys Arg Asp Tyr Phe Leu Ser Lys Thr Leu Glu Cys 

185 190 195 

Leu Pro Ser Asn Thr Gin Thr Met Ser Tyr Leu Asp Ser He Gin 

1 0 200 205 * 210 

lie Pro Ser Trp Lys lie Asp Phe Ala Arg Gly Glu He Lys He 

215 220 , 225 

Ser Pro Gin Ser lie Ser Val Ala Lys Ser Leu Leu Lys Leu Asp 

230 235 240 

15 Leu Ser Gly He Lys Lys Lys Glu Ser Lys Val Lys Glu Ala Tyr 

245 250 255 

Ala Ser Gly Ser Lys 

260 

(2) INFORMATION FOR SEQ ID NO:8: 

2 0 (I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 858 base pairs 

(B) TVpfe: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2 5 (ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
TTAACACACT AAGCAAGCAC AAACAATAAA GATAAAGAAA GCTTIATATA 50 
TTTATAGGCT TTTITATAAT TTAACTTACA GCTGCTTT1A AGCAAGTTCT 100 
GTGAGTTTTG CCTGTTrTIT AACOCCAAAC AITTCATAGA ACTTGTTAAG 150 

3 0 GGTTTCACTG TAATGTTCCA TAGCAATACT TCCTTTAQCA TTAGGATTGC 200 

TGGAGCTAAG TATAGCAGCA TACTCTTTCC CCTTCTTCAC CTGATCTTCA 250 
TTCATTTCAA ATGCTTITCT TTTCAGCACA GTGCAAACTT TTOCTAAGGC 300 
TTCCCTGGTG TCAIACTTCT TTGGGTOGAT COCGAGATCC TTGTATTTTG 350 
CATraPGATA TATAGOCAAG ACAACACTGA TCATCTCAAA GCTATCAACT 400 

3 5 GAAGGAATAA GAGGTAAGCT ACCTCCCAGC ATTATGGCAA GCXTCACAGA 450 

CTTiGCATCA TCAAGAGGTA ATCCATAGGC TTGACTCAAA GGGTGGGAAG 500 
CAATtTTAGA TTTGATAGTA TTGAGATTCT CAGAATTCCC AGTTTOCTCA 550 
ACAAGCCTGA CCCTGATCAA GCTATCAAGC CTTCTGAAGG TCATGTCAGT 600 
GGCTXXAATC CTGTCTGAAG TTTICTTTAT GGTAATTTTA CCAAAAGTAA 650 

4 0 AATCGCTTTG CTTAATAAX TTCATTATGC TCTGACGATT CTTCAGGAAT 700 
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GTGAGACAIG AAAlAAlTGCT CATCTnTTG ATCTGGTCAA GGTJ.T1CCAG 750 

ACAAAAAGTC TlGAASlTGA ATGCIAOCAG ATTCTGATCT TCCTCAAA.CT 800 

CAAGGTCriT GCCrt^GTC AACAAAGCAA CAATGCTTTC CTIAGTGAGC 850 
TTAACCAT 858 
5 (^iN^dftMAtlONPOftSfeQIDNO:^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2028 base pairs 

(B) TYPE: nucleic acid 
(6) sTRANbEDNESS: single 

10 (b)TOPt)L0GY: linear 

(ii) MOLECULE TYPE: DNA ' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

AAAlSffCTCTf GCAGIGAAAf CTCTGCTCAT GTTAGCAGAA AACAACATCA 50 

TGCC^AACfC ltSAAQCfft*ri? GTCAAAGCTT CTACTGATTC TAATTTCAAG 100 

1 5 cn^ajdrcr ggctaagggt tccaaaggtt ttgaagcaga tttccattca 150 

GAAftftGTTC aaggttgcag gagatgaaac aaataaaaca ttttatttat 20b 

CnMllGCGTG CATTOCAAAC CATAACAGTG TTGAGACAGC TTTAAACAIT 250 

ACKxtlMTT GCAAGGATCA GCKXCAATT CGIAAATGTA AAACTCCTTT 300 

tgaAI'Iaica. Aiteftirsrtrr ctgAtttaaa ggagccttac aacattattc 350 

2 0 ATGAICCTTC AiATOCCCAA AGGATTGTTC ATGCTCTGCT TGAAACTCAC 400 

ACATCTTITG CACAAGrrCT TTGCAACAAC TTGCAAGAAG ATGTGATCAT 450 

ClACACCTTG AACAACCATG AGCTAACTCC TQGAAAGTTA GATTTA.GGTG 500 

AAAiAACTTT GAATTACAAT GAAGACGCCT ACAAAAGGAA ATATTTCCIT 550 

TCAAAAACAC TIGAATGTCT TCCATCTAAC ATACAAACTA TGTCTTATTT 600 

2 5 AGAGAGCATG CAAATCCCTT CCTGGAAGAT AGACTTTGCC AGGGGAGAAA 650 

TEAAAAiTTC TOCACAAltT AiTTCAGTTG CAAAATCTIT GTTAAATCTT 700 

GAlTffiAAGCG GGArtAAAAA. GAAAGAATCT AAGATTAAGG AAGCATATGC 750 

TTGAGGATCA AAATGATCTT GCTGTGTCCA GCTTTTTCTA ATEATGTTAT 800 

GTnMTITC rriCfnACT TAIAATTATT TITCTGTITG TCAIl'lCI'IT 850 

3 0 GAAATTCCTC CIGTCIAG1A GAAAOCAIAA AAACAAAAAT AAAAAIAAAA 900 

TAAAAiCAAA. AIAAAATAAA AATCAAAAAA TGAAATAAAA GCAACAAAAA 950 

aattaaaaaA CAAAAAACCA AAAAAGATCC CGAAAGGACA ATTTTGGOCA 1000 

AATTI1GGGGT TTGTTfrTGT TTlTTGTnT TTTGTTTTTT GTTTTEATTT 1050 

TTAlTTTTAT TlTimTnT AITI'IMTIT ATITEATGTT TTIGTIGTTT 1100 

3 5 TTGTTMTIT GTIATTIA1T AAGCACAACA CACAGAAAGC AAACTTEAAT 1150 

TAAACACACT TATTEAAAAT TEAACACACT AAGCAAGCAC AAACAATAAA 1200 
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GMaAAGAAA GCTTEAiATA TiTAlAGGCT TTTTTAIAAT TTOACTIACA 1250 
GCTGJrrim AGCAAGrftCT GTGAGTnTG CCTCTTTITr AACCCCAAAC 1300 
ATtfTCATAGA ALTlUi'lAAG GGTTTCACTG TAATGTTOCA TAGCAAIACT 1350 
TOCMTAGCA ttAGGAfrGG TQGAGCTAAG TATAGCAGCA TACTCTTICC 1400 
5 GuriuriCAC CTGATCfTCA iTtATITCAA ATGCTTTTCT IflTCAGCACA 1450 
GTGGAAAC1T rttCEAAGGC" iltCCTGGTG TCAIACTTCT ©GGGTCGAT 1500 
CCCGAGATCC iTteiMl'iTG CATCCTGATA TATAGCCAAG ACAACACTGA 1550 
TCMtffCAAA GCMcAACf GAAGCAATAA GAGGTAAGCT AOCTOCCAGC 1600 
AlM&CAA GCCTGACAgA CflTGCATCA TCAAGAGGTA MtCATAQGC 1650 
1 0 T^CTCAAA GQGTGGGAAG CAATCTEAGA riTGAiAGTA f^GAGATICT 1700 
CAGAAffcCC AG'lTltXTCA ACAAGCCTGA COCTGATCAA GCTATCAAGC 1750 
C^OTaAGG TCATGTCAGt GGCTOCAATC CTGTCTGAAG TTITC7ITEAT 1800 
GG^KTTA CCAAAAGTAA AATCQCTTTG CTTAATAACC TTCATTATCC 1850 
TC?fGAGGATT dTTCAGGAAT GlCAGACATG AAATAATQCT CAT CmTI G 1900 

1 5 AlCfGSTtAA GGTITTCCAG ACAAAAAGTC TTGAAGTTGA ATCCTACCAG 1950 

Arn^SATCT TOCTCAAACT CAAGGTCITT GOCTTGTGTC AACAAAGCAA 2000 
. CAAl^GGTrrc CflAG'rGAGC iTAACCAT 2028 
(2) INF6RMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

2 0 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) SfRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

2 5 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

TrcTGGTCTT CTTCAAACT CA 22 
(2) INFORMATION FOR SEO ID NO:1 1 : 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

'3 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:1 1 : 

3 5 CTCJAQOCAT GAGCAAAG 18 

(2) INFORMATION FOR SEQ ID N0:12: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 
(G)STRANbEDNESS: single 
(D)TOPOLdQV: linear 

5 (li) MOLECULE TYP£: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:i2: 



Met 


ser 


Ser 


Gly 


Val 
5 


Tyr 


Glu 


Set 


lie 


Ile 
10 


Gin Thr 


Lys Ala 


Ser 
15 


Val 


Trp 


Gly 


Ser 


Thr 
20 


Ala 


Ser 


Gly 


Lys 


Ser 
25 


lie Val 


Asp Ser 


Tyr 
30 


Tip 


lie 


Tyr 


Glu 


Phe 
35 


Pro 


Thr 


Gly 


Ser 


Pro 
40 


Leu ,Val 


Gin Thr 


Gin 
45 


Leu 


Tyr 


Ser 


Asp 


Ser 


Arg 


Ser 


Lys 


Ser 


Ser 


Phe Gly 


Tyr Thr 


Ser 










50 










55 




60 


Lys 


Ile 


Gly 


Asp 


Ile 
65 


Pro 


Ala 


Val 


Glu 


Glu 
70 


Glu Ile 


Leu Ser 


Gin 
75 


Asn 


Val 


His 


Ile 


Pro 


Val 


Phe 


Asp 


Asp 


Ile 


Asp Phe 


Ser Ile 


Asn 










80 










85 




90 


lie 


Ash 


Asp 


Ser 


Phe 


Leu 


Ala 


Ile 


Ser 


Val 


Cys Ser 


Asn Thr 


Val 




Thr 






95 










100 






105 


Ash 


Asn 


Gly 


Val 


Lys 


His 


Gin 


Gly 


His 


Leu Lys 


Val Leu 


Ser 




Ala 






iio 










115 




120 


Leu 


Gin 


Leu 


His 


Pro 


Phe 


Glu 


Pro 


Val 


Met Ser 


Arg Ser 


Glu 










125 










130 




135 


Ile 


Ala 


Ser 


Arg 


Phe 
140 


Arg 


Leu 


Gin 


Glu 


Glu 
145 


Asp Ile 


Ile Pro 


Asp 
150 


/Asp 


Lys 


Tyr 


Ile 


Ser 


Ala 


Ala 


Asn 


Lys 


Gly 


Ser Leu 


Ser Cys 


Val 


Lys 








155 










160 




165 


G1U 


His 


Thr 


Tyr 
170 


Lys 


Val 


Glu 


Met 


Ser 
175 


His Asn 


Gin Ala 


Leu 
180 


Giy 


Lys 


Val 


Asn 


Val 
185 


Leu 


Ser 


Pro 


Asn 


Arg 
190 


Asn Val 


His Glu 


Trp 
195 


Leu 


Iyr 


Ser 


Phe 


Lys 
200 


Pro 


Asn 


Glu 


Asn 


Gin 
205 


Ile Glu 


Ser Asn 


Asn 
210 


Arg 


Thr 


Val 


Asn 


Ser 


Leu 


Ala 


Val 


Lys 


Ser 


Leu Leu 


Met Ala 


Thr 










215 








220 






225 


Glu 


Asn 


Asn 


Ile 


Met 


Pro 


Asn 


Ser 


Gin 


Ala 


Phe Val 


Lys Ala 


Ser' 










230 










235 




240 


Thr 


Asp 


Ser 


His 


Phe 


Lys 


Leu 


Ser 


Leu 


Gin 


Leu Arg 


Ile Pro 


Lys 










245 










250 




255 


Val 


Leu 


Lys 


Gin 


Ile 
260 


Ala 


Ile 


Gin 


Lys 


Leu 
265 


Phe Lys 


Phe Ala 


Gly 
270 


Asp 


Glu 


Thr 


Gly 


Lys 


Ser 


Phe 


Tyr 


Leu 


Ser 


Ile Ala 


Cys Ile 


Pro 










275 






65 




280 




285 
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Asn 


His Asn 


Ser 


Val 


Glu Thr 


Ala 


Leu 


Asn 


Val Thr Val Ile Cys 








290 








295 


300 


Arg 


His Gin 


Leu 


Pto 


116 Pro 


Lys 


Ser 


Lys 


Ala Pto Phe Glu Leu 








305 








310 


315 


Ser 


Met lie 


Phe 


Ser 


Asp Leu 


Lys 


Glu 


Pro 


Tyr Asn Thr Val His 






Tyr 


320 








325 


330 


Asp 


Pro Set 


Pto 


Girl Arg 


Ile 


Val 


His 


Ala Leu Leu Glu Thr 




lilt Set 




335 








340 


345 


His 


Phe 


Ala 


Gin Val 


Leu 


Cys 


Asn 


Lys Leu Gin Glu Asp 








350 








355 


360 


Val 


lie lie 


Tyr 


Tht 


Ile Asn 


Ser 


Pro 


Glu 


Leu Thr Pro Ala Lys 








365 








370 


375 


Leu 


Asp Leu 


Gly 


GlU 


Arg Thr 


Leu 


Asn 


Tyr 


Ser Glu Asp Ala Ser 








380 








385 


390 


Lys 


L^s Lys 


Tyr 


Phe 


Leu Ser 


Lys 


Thr 


Leu 


Glu Cys Leu Pro Val 








395 








400 


405 


Asn 


Val Gin 


Thr 


Mat 


Set Tyr 


Leu 


Asp 


Ser 


Ile Gin Ile Pro Ser 








410 






415 


420 


Trp 


Lys Ile 


Asp 


Phe 


Ala Arg 


Gly 


Glu 


Ile 


Arg Ile Ser Pro Gin 




Iht Pro 




425 








430 


435 


Ser 


Ile 


Ala 


Arg Ser 


Leu 


Leu 


Lys 


Leu Asp Leu Ser Lys 








440 








445 


450 


lie 


Lys Glu 


Lys 


Lys 


Ser Leu 


Thr 


Trp 


Glu 


Thr Ser Ser Tyr Asp 








455 








460 


465 


Leu 


Glu; 

















(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

3 0 (C)STRANDEDNESS: single 

(D)ToPOL0GY: linear 
(II) MOLECULE TYPE: peptide 
(*l) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
Met Set Lys Val Lys LeU Thr Lys Glu Asn Ile Val Ser Leu Leu 

3 5 5 10 15 

Thr Gin Ser Ala Asp Val Glu Phe Glu Glu Asp Gin Asn Gin Val 
20 25 30 

Ala Phe Asn Phe Lys Thr Phe Cys Gin Glu Asn Leu Asp Leu Ile 
35 40 45 

4 0 Lys Lys Met Ser He Thr Ser Cys Leu Thr Phe Leu Lys Asn Arg 

50 55 " 60 

Gin Gly Ile Met Lys Val Val Asn Gin Ser Asp Phe Thr Phe Gly 
65 70 75 
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Lys" Val Thr He Lys Lys Asn Ser Glu Arg Val Gly Ma Lys Asp 
80 85 90 

Met Thr Phe Arg Arg Leti Asp Ser Met He Arg Val Lys Leu He 
95 100 105 

5 GlU Glu Thr Ala Asn Asn Glu Asn Leu Ala lie He Lys Ala Lys 
110 115 120 

116 Ala Ser His Pro Let! Val Gin Ala Tyr Gly Leu Pro Leu Ala 
125 130 135 

Asp Ala Lys Ser Val Arg Leu Ala He Met Leu Gly Gly Ser He 
10 140 145 ^ 150 

Pro Leu He Ala Ser Val Asp Ser Phe Glu Met He Ser Val Val 
155 160 , 165 

Lett Ala He Tyr Gin Asp Ala Lys Tyr Lys Glu Leu Gly He Glu 
HO 175 180 

1 5 Pro Thr Lys Tyr Asn Thr Lys Glu Ala Leu Gly Lys Val Cys Thr 

185 190 195 

Val Leu Lys Ser Lys Gly Phe Thr Met Asp Asp Ala Gin He Asn 

200 205 210 

Lys Gly Lys Glu Tyr Ala Lys He Leu Ser Ser Cys Asn Pro Asn 
20 215 220 225 

Ala Lys Gly Ser tie Ala Met Asp Tyr Tyr Ser Asp Asn Leu Asp 

230 235 ~ 240 

LyS Phe Tyr Gltf Met Phe Gly Val Lys Lys Glu Ala Lys He Ala 

245 250 255 

2 5 Gly Val Ala 

• (2) INFORMATION FOR SEQ ID NO:14: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3049 base pairs 

(B) TYPE: nucleic acid 

3 0 (C) StRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) molecule type: dan 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
AGAGCAATTG GGTCATTTTT TATTCTAAAT CGAACCTCAA CTAGCAAATC 50 

3 5 TGAGAACTGT AATAAGCACA AGAGCACAAG AGCCACAATG TCATCAGGTG 100 

TTTATGAATC GATCATTCAG ACAAAGGCTT CAGTTTGGGG ATCGACAGCA 150 

TCTGGTAAGT CCATCGTGGA TTCTTACTGG ATTTATGAGT TTCCAACTGG 200 

TTCTCCACTG GTTCAAACTC AGTTGTACTC TGATTCGAGG AGCAAAAGTA . 250 

GCTTCGGCTA CACTTCAAAA ATTGGTGATA TICCTGCrGT AGAGGAGGAA 300 

4 0 AITTTATCTC AGAACGTTCA TATCCCAGTG TITGATGATA TTGATTTCAG 350 

CATCAATATC AATGATTCTT TCTTGGCAAT TTCTGTTTGT TCCAACACAG 400 
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frAACAOCAA TGGAGTGAAG CATCAGGGTC ATCTTAAAGT TCITICTCTT 450 

(maAITGC ATCCCTlTGA ACCTGTGATG AGCAGGTCAG AGATIGCTAG 500 

CAG&rTCCGG CTCCAAGAAG AAGATAIAAT TCCTGATGAC AAATATAIAT 550 

CTGClECTAA GAAGGGAittf CTCTCCTGTG TCAAAGAACA tACTEACAAA 600 

5 GTdGAAAiGA GOCACAAltA GGCTMqGC AAAGTGAATG flCTTTCTCC 650 

TAACAGAAAX GttCA^GAGf mCAAACCA AATTTCAACC 700 

AGAltlGAAAG iAAiAACAGA AClXaEAAATT CTCTTGCAGT CAAAICTTTG 750 

CTGA*GGCTA CAgAAAAGAA CATTATGOCT AACTCTCAAG CnTTCTTAA 800 

AGGffifcACT GATICtCAlT? ttAAGTTGAG CCTTTGGCIG AGAAITCCAA 850 

1 0 AAbWitiAA gcaaa$agcc AiacAgaagc TCTTCAAGIT TOCAGGAGAC 900 

GAaACCGGTA AAAGTTrCTA ttTGTCTATT GCATGCATCC CAAATCACAA 950 

CAGfGfGGM ACAGCTiTAA ATCTCACTGT TA1A1GTAGA CATCAGCTTC 1000 

CAA$GCCTAA GTCCAAAGCT (XSTTTGAAT TATCAATGAT TTTCTCCGAT 1050 

CTGAAAGAGC CTTACAACAC IGTGCATGAT CCTTCATATC CTCAAAGGAT 1100 

1 5 TGTttAlGCT TIGCrtGAGA CTCACACTTC CTTTGCACAA GITCTCTGCA 1150 

ACAAGGIGCA AGAAGAlGTG ATCATATATA CTATAAACAG COCTGAACTA 1200 

AQXCAGCTA AGCTGGATCT AGGTGAAAGA ACCTTGAACT ACAGTGAAGA 1250 

TGCfTCGAAG AAGAAGTATT liCTITCAAA AACACTCGAA TGCTIGCCAG 1300 

TAAMGTGCA GACTATGTCT TATTTGGATA GCATCCAGAT TCCTIGATGG 1350 

2 0 AAGAmGACT .TTGCCAGAGG AGAGATCAGA ATCTOCCCTC AATCTACTCC 1400 

TAT'TGGAAGA TCTTIGCTCA AGCTOGATTT GAGCAAGATC AAGGAAAAGA 1450 

AGTCCftGAC TiGGGAAACA TCCAGCTAlG ATCTAGAATA AAAGTGGCTC 1500 

AIJ^CTCT AAGTAGTATT TGTCAACTTG CTEATCCTTT ATGTTGTTTA 1550 

TrtX^ITAA ATCTAAAGTA AGTTAGAITC AAGTAGTTEA GTATGCTATA 1600 

2 5 GCAi'iMlAC AAAAAATAGA AAAAAATACA AAAAAAIACA AAAAATATAA 1650 

AAAAGGCAAA AAGAtCCCAA AAGGGACGAT TIGGTTGA1T TACTCTGTTT 1700 

TAGGC1*1AIC iMGCfTGCTT ftXSTTTGAGC AAAATAACAT TGTAACAIGC 1750 

AATAAGTGGA AlftAAAGTC CTAAAAGAAG TTTCAAAGGA CAGCTTAGCC 1800 

AAAATftSGTT TriGTiTTTG iTTITTrGTT TITTGTrnT TTGTTTTATT 1850 

3 0 TTTAilTTTA G1TIA1TTTT TGTTITIGTT ATTTTrATTT TTATTTTAJT 1900 

Tiul'i'rmlT T1MT1ATAT ATATATCAAA CACAATCCAC ACAAATAA1T 1950 

TT&ATTrcAA AGAlTCmCT GA1TEAACAC ACTTAGCCTG ACTTTATCAC 2000 

ACTEAACACG CttAGTtAGG CTTEAACACA CIGAACTGAA TTAAAACACA 2050 

CTEAGfAiTA TGCATCTCiT AATTAACACA CTTTAATAAT AlGCATCTCT 2100 

3 5 GMTCAGCCT TAAAGAAGCT TTTATGCAAC ACCAGCAATC TIGGOCTCTT 2150 
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TCTTAACTCC AAACATTTCA TAGAATTTGT CAAGATTATC ACTGTAATAG 2200 
TCCATAGCAA TGCTt^CCTT AGCATTGGGA TTGCAAGAAC TAAGTATCTT 2250 

ggcAtattct iMiM tttatckttgc atcatccatt gtaaatcctt 2300 
tMMtaag cact^tgcaA Accttccoca gagcttcctt agtcttgtac 2350 

TTAtfiTGGTT O^TOCCMa CTO^TGmC TTTOCATCTT GaTATATGGC 2400 

AAGAACAACA ClGAlCMci 1 CXjAAGCTGTC AACAGAAGCA ATGAGAGGGA 2450 

TACMCCTCC AAGCAttMA GCAAGTCTCA CAGATITTGC ATCTGCCAGA 2500 

C4GGAGCCCGT AAGCTTGGAC CAAAGGGTGG GAGGCAATTT TTGCTTTGAT 2550 

AAI^GGAaGA TfiCTCATTGT TTGCAGTCTC TTCTATGAGC TTCACTCTTA 2600 

TCATGGTATC AAGCCroCTG AAAGTCATAT CCTIAGCTCC 'aACTCTITCA 2650 

GAATITTTCT TTATCtSTGAC CTTAcCAAAA GTAAAATCAC TTTGGTTCAC 2700 

AACTTTXATA ATCCCTTjGGC GATTCTTCAA GAAAGTCAAA CATGAAGTGA 2750 

TAGTCAlTTT CiTAATCAGG TCAAGATTTT CCTGACAGAA AGTCTTAAAG 2800 

TIGAATGCGA CCTGGTTCTG GTCTICTTCA AACTCAACAT CTGCAGATTG 2850 

AGTTAAAAGA GAGACAATGT TTTCTITTGT GAGCTTGACC TTAGACATGG 2900 

TGGCAGTTtA GATCTAGACC ITTCTCGAGA GATAAGATTC AAGGTGAGAA 2950 

AGTGGAACAC TGTAGACCGC GGTCGTCACT TATCCTGTTA ATGTGATGAT 3000 

TTGTATTCCT GAGTATTAGG TTTTTGAATA AAATTGACAC AATTGCTCT 3049 
(2) INFORMATION Po& SEQ ID N0:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: Nucleic acid 

(C) SfttANDEbNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

ATG CAA CAC CAG CAA TCT TGG CCT CTT TCT TAA CTC CAA 39 

ACA TiT CAT AGA ATT TGT CAA GAT TAT CAC TGT AAT AGT 78 

CCA TAG CAA TGC TTC CCT TAG CAT TGG GAT TGC AAG AAC 117 

TAA GTA TCT TGG CAT ATT CTT TCC CTT TGT TEA TCT GTG 156 

CAT CAT CCA TTG TAA ATC CTT TGC TTT TAA GCA CTG TGC 195 

AAA CCT TCC CCA GAG CTT CCT TAG TGT TGT ACT TAG TTG 234 

GTT CAA TCC GTA ACT CCT TGT ACT TTG CAT CTT GAT ATA 273 

TGG GAA GAA CAA CAC TGA TCA TCT CGA AGC TGT CAA CAG 312 

AAG CAA TGA GAG GGA TAG TAC CTC CAA GCA TTA TAG CAA 351 

GTC TCA CAG ATT TTG CAT CTG CCA GAG GCA GCC CGT AAG 390 
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. CTT GGA OCA AAG GGT GQG AGG CAA TTT TTG CTT TGA TAA 429 
TAG CAA GAT TCT CAT TGT TTG CAG TCT CTT CTA TGA GCT 468 
TCA CTC TTA TCA TGC TAT CAA GCC TCC TGA AAG TCA TAT 507 
CCI? TAG CTC CAA CtC TTT 1 CAG AAT TTT TCT TTA TCG TGA 546 
5 CCT TAfi CAA AAG TAA AAT" CAC TIT GGT TCA CAA CTT TCA 585 
TAA TGC CTT GGC GAT TCT TCA AGA AAG TCA AAC ATG AAG 624 
TGA TAG TCA TTT TCT TAA TCA GGT CAA GAT TTT OCT GAC 663 
AGA AAG TCT TAA AGff TGA ATG CGA OCT GGT TCT GGT CTT 702 
CTT CAA AC? CAA CAT CTG CAG ATT GAG TTA AAA GAG AGA 741 

1 0 CAA TGT TTT CTT TTG TGA. GCT TGA OCT TAG ACA TGG 778 

(2) INFORMATION FOR SEti ID NO:l6: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic acid 
15 (C) STftANbfeDNESS: single 

(D)TOFOLOdV: linear 
(ii) MOLECULE TVPfe: DMA 
(xO SEQUENCE bESCRIPTION: SEQ ID NO:16: 
GTTCTGAGAT TTGCTAGT 18 

2 0 (2) INFORMATION FOR SlQ ID N0:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANbEbNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17: 
TTATATCTTC TTCTTGGA 18 
(2) INFORMATION FOR SEQ ID NO:18: 

3 0 (I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEbNESS: single 

(D) TOPOLOGY: linear 
35 (II) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 
ATG TCA TCA GGT GTT TAT GAA TCG ATC ATT CAG ACA AAG 39 
GOT TCA GTT TGG GGA TCG ACA GCA TCT GGT AAG TCC ATC 78 
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GTG GAT TCT TAC TGG ATT TAT GAG TTT CCA ACT GGT TCT 117 
CCA CTG GTT CAA ACT GAG TTG TAG TCT GAT TCG AGG AGC 156 
AAA ACT AGC TTC GGC TAC ACT TCA AAA ATT GGT GAT ATT 195 
CCT GCT GTA GAG GAG GAA ATT TEA TCT CAG AAC GTT CAT 234 
ATC CCA GTG TTT GAT GAT ATT GAT TTC AGC ATC AAT ATC 273 
AAT GAT TCT TTC TTG} GCA ATT TCT GTT TGT TCC AAC ACA 312 
GTT AAC ACC AAT GGA GTG AAG CAT CAG GGT CAT CTT AAA 351 
GTT CTT TCT CTT GCC CAA TTG CAT CCC TTT GAA CCT GTG 390 
ATG AGC AGG TCA GAG Al*f GCT AGC AGA TTC CGG CTC CAA 429 
GAA GAA GAT ATA ATT GGT GAT GAC AAA TAT ATA TGT GCT 468 
GCT AAC AAG GGA TCT GTG TCC TGT GTC AAA GAA CAT ACT 507 
TAC AAA GTC GAA ATG AGC CAC AAT CAG GCT TTA GGC AAA 546 
GTG AAT GTT CTT TCT CCT AAC AGA AAT GTT CAT GAG TGG 585 
CTG TAT ACT TTC AAA CCA AAT TTC AAC CAG ATC GAA ACT 624 
AATAACAGAACTGTAAaTTCTCTTGCAGTCAAAT^ 663 
CTC ATG GCT ACA GAA AAC AAC ATT ATG CCT AAC TCT CAA 702 
GCT TTT GTT AAA GCT TCT ACT GAT TCT CAT TTT AAG TTG 741 
AGC CTT TGG CTG AGA ATT CCA AAA GTT TTG AAG CAA ATA 780 
GCC ATA CAG AAG CTC TTC AAG TTT GCA GGA GAC GAA ACC 819 
GCT AAA ACT TTC TAT TTG TCT AIT GCA TGC ATC CCA AAT 858 
CAC AAC ACT GTG GAA ACA GCT TTA AAT GTC ACT GTT ATA 897 
TCT AGA CAT CAG CTT CCA ATC CCT AAG TCC AAA GCT CCT 936 
TTT GAA TTA TCA ATG ATT TTC TCC GAT CTG AAA GAG CCT 975 
TAC AAC ACT GTG CAT GAT CCT TCA TAT CCT CAA AGG ATT 1014 
GTTCATGCTTTGCTTGAGACTCA^ 1053 
GTT CTC TGC AAC AAG CTG CAA GAA GAT GTG ATC ATA TAT 1092 
ACT ATA AAC AGC CCT GAA CTA ACC CCA GCT AAG CTG GAT 1131 
CTA GGT GAA AGA ACC TTG AAC TAC ACT GAA GAT GCT TCG 1170 
AAGAAGAAGTATTTTCTTTCAAAAA^ i209 
CCA GTA AAT GTG CAG ACT ATG TCT TAT TTG GAT AGC ATC 1248 
CAG ATT CCT TCA TGG AAG ATA GAC TTT GCC AGA GGA GAG 1287 
ATC AGA ATC TCC CCT CAA TCT ACT CCT ATT GCA AGA TCT 1326 
TTG CTC AAG CTG GAT TTG AGC AAG ATC AAG GAA AAG AAG 1365 
TCC TTG ACT TGG GAA ACA TCC AGC TAT GAT CTA GAA 1401 
(2) INFORMATION FOft SEQ ID N0:19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEt)NESS: single 
5 p)TOPQiaQY: linear 

molecule tYPE: dna 

(xl) SEQUENCE b£ScRlPTION: SEQ ID NO:19: 



Arc 


TCP 


AAG 


GTC 


AAd 


m 


ACA 


AAA 


GAA 


AAC 


ATT 


GTC TCT 


CTT 


TEA 


45 


ACT 


CAA 


TCT? 


GCA 


GAT* 


GTT 


GAG 


TTT 


GAA 


GAA 


GAC 


CAG AAC 


CAG 


GTC 


90 


GCA 


TTC 


AAC 


TTT 


AAG 


Act 


TTC 


TGT 


CAG 


GAA 


AAT 


CTT GAC 


CTG 


ATT 


135 


AAG 


AAA 


ATG 


ACT 


m 


Act 


TCA 


TGT 


TTG 


ACT 


TTC 


TTG AAG 


AAT 


CGC 


180 


CAA 


GGC 


ATT 


ATG 


AAA 


GTT 


GTG 


AAC 


CAA 


AGT 


GAT 


ttt Act 


TTT 


GGT 


225 


AAG 


GTC 


ACG 


ATA 


AAG 


AAA 


AAT 


TCT 


GAA 


AGA 


GTT 


GGA GCT 


AAG 


GAT 


270 


ATG 


ACT 


TTC 


AGG 


AGG 


CTT 


GAT 


AGC 


ATG 


ATA 


AGA 


GTG AAG 


CTC 


ATA 


315 


GAA 


GAG 


ACT 


GCA 


AAG 


AAT 


GAG 


AAT 


CTT 


GCT 


AIT 


ATC AAA 


GCA 


AAA 


.360 


Arr 


dec 


TCC 


CAC 


OCT 


TTG 


GTC 


CAA 


GCT 


TAC 


GGG 


CTG CCT 


CTG 


GCA 


405 


GAT 


GCA 


AAA 


TCT 


GTG 


AGA 


CTT 


GCT 


ATA 


ATG 


CTT 


GGA GGT 


AGT 


ATC 


450 


ccr 


CTC 


ATT 


GGT 


TCT 


GTT 


GAC 


AGC 


TTC 


GAG 


ATG 


ATC AGT 


GTT 


GTT 


495 


CTT 


GCC 


ATA 


TAT 


CAA 


GAT 


GCA 


AAG 


TAC 


AAG 


GAG 


TEA GGG 


ATT 


GAA 


540 


CCA 


ACT 


AAG 


TAC 


AAC 


ACT 


AAG 


GAA 


GCT 


CTG 


GGG 


AAG GTT 


TGC 


ACA 


585 


GTG 


CTT 


AAA 


AGC 


AAA 


GGA 


TTT 


ACA 


ATG 


GAT 


GAT 


GCA CAG 


ATA 


AAC 


630 


AAA 


GOG 


AAA 


GAA 


TAT 


GCC 


AAG 


ATA 


CTT 


ACT 


TCT 


TGC AAT 


GCC 


AAT 


675 


GOT 


AAG 


GGA 


AGC 


ATT 


GCT 


ATG 


GAC 


TAT 


TAC 


ACT 


GAT AAT 


CTT 


GAC 


720 


AAA 


TTC 


TAT 


GAA 


ATG 


TTT 


GGA 


GTT 


AAG 


AAA 


GAG 


GCC AAG 


ATT 


GCT 


765 


GGT 


GTT 


GCA 


TAA 


777 





















(2) INFORMATION FOR SEQ ID NO20: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: Nucleic acid 
3 0 (C) STRANDEbNESS: single 

(DJTOPOLOGV: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO20: 
TACTTATCTA GAACCATGGA CAAAGCAAAG ATTACCAAGG 40 
3 5 (2) INFORMATION POR SEQ ID NO:21 : 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 
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(B) TYPE: Nucleic acid 

(C) STRANbEbNESS: single 

(D) TOPOLOGY: linear 
00 MOLECULE 1YPE; DNa 

5 (xQ SEQUENCE DESCRIPTION: SEQ ID N021 : . 

TtoGTOG&i? CGA^GGITAT TTCAAAIAAT TTAIAAAAGC AC 42 
(2) INFORMATION FOR SEQ lb N022: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

10 (B)TVPfe: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPoLOGY: linear 
(ii) MOLECULE TYRE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
* 5 AGEA1TQGAT CCATGGITAA CACACTAAGC AAGCAC 36 

(2) INFORMATION FOR SEQ ID NO:23: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: Nucleic acid 
20 (C)STRANDEbNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N053: 
AGCTAA.TCTA. GAACCATGGA TGACTCACTA AGGAAAGCAT TGTTGC 46 

2 5 (2) INFORMATION FOR SEQ ID N024: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

3 0 (D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID N024: 
CCC^CTATCC TICQCAAGAC CC 22 
(2) INFORMATION FOR SEQ lb NO:25: 
3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE bESCRlPTION: SEQ ID N025: 

TACACTGGAT CCATGGTTAA GGIAATCCAT AGGCTTGAC 39 
(2) INFORMATION FOR SEQ ID NO:26: 
5 (i) SEQUENCE CHARACTERISTICS: 

- (A) LENQtH: 40 base pairs 

(B) TYPk Nucleic acid 

(C) STRANDEbNESS: single 
P) TOPOLOGY: linear 

10 (ii) MOLECULE WE: DNA 

(xi) SEQUENCE bESCRIFTION: SEQ ID N026: * 

AqctaAdcat Ggteaagctc ACTAAGGAAA GCATTGTTSC 40 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

1 5 (A) LENGTH: 46 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYRE: DNA 

2 0 (xi) SEQUENCE bESCRIFTION: SEQ ID N027: 

AGCTAATCm GAACCATQGA TGA.CTCACTA. AGGAAAGCAT TGTTGC 46 
(2) INFORMATION FOR SEQ ID NO:28: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGtH: 36 base pairs 

2 5 (B)TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYRE: DNA 

(xi) SEQUENCE bESCRIFTION: SEQ ID N028: 

3 0 AGCATTGGAT OCATGGTIAA CACACTAAGC AAGCAC 36 

(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic acid 
3 5 (C) STRANbEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

TACAGTTCTA GAACCATGGA TGATGCAAAG TCTGTGAGG 39 
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(2) INFORMATION FOR SEQ lb NO:30: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: Nucleic acid 
5 (C)STRANDEDNESS: single 

p)TOm03Y: linear 
(II) MOLECULE tVRE: DMA 
(xl) SEQUENCE bESCRIPTION: SEQ ID NO:30: 

AGASK?i?CTA dACCATCCTG ACTTGATCAG O^GTCTGT GA.GGCTTGC 49 
1 0 Thiis While We have illustrated and described the preferred 

embodiments of OUf Invention, it is to be understood that this invention 
is Capable 6f Variati6ri and modification, and we therefore do not wish 
td be limited tO the precise terms set forth, but desire to avail 
bUfselVeS Of SUch Changes and alterations which may be made for 

1 5 adapting the Invention tb various usages and conditions. Such 

variations arid modifications, for example, Would Include the 
Substitution of Structurally similar nucleic acid sequences in Which the 
difference between the sequence Shown and the variation sequence is 
SUCh that little If arty advantages are available With the Variation 
20 Sequence, i.e. that the Sequences produce substantially Similar results 
as described above. Thus, changes in sequence by the substitution, 
deletion, insertion or addition of nucleotides (in the nucleotide 
sequences) of amino acids (in the peptide sequences) Which do hot 
substantially alter the function of those sequences Specifically 

2 5 described above are deemed to be within the scope of the present 

invention. In addition, It Is our intention that the present invention may 
be modified tO join the N genes of various isolates that provide 
resistance Or immunity to tospdvirus infection of plants according to 
the present invention Into a single cassette, and to use this cassette as 

3 0 a trartsgene in order to provide broad resistance to the TOspoViruses, 

especially tO TsWV-BL, TSWV-B, and INSV. Accordingly, SUch changes 
and alterations are properly Intended to be within the full range of 
equivalents, and therefore Within the purview of the following -claims. 
Having thus described Our invention and the manner and a process 
3 5 of making and Using it In such full, clear, concise and exact terms so as 
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to ©liable any pefSOh skilled in the art to Which It pertains, or with 
which It is most nearly fcdrihected, to make and use the same; 
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We Claim: 

1. An isolated nucleotide sequence Which is selected from the 

group 

AAATitTCTT GCflGT GAAA ff CTCTGCTCAT GTEAGCAGAA AACAACATCA 50 
TGCCTAACTC TCAAGCifTTi' GTCAAAGCTT raCTGATTC TAATTTCAAG 100 
CTGAQd!^ GGCTAAGGGt T^CMAGGTT TTGAAGCAgA TTTCCATTCA 150 

gAaaStgtTC MotfMAd &gMgaaac AaataaaacA ttttatitat 200 

Clftggp Gft3 GATTCGAAACl cMaACAGTG TTGAGAcAGC TTTAAACArT 250 

GmAGGATCA G^OGCAffi CGEAAATGTA AAACTCCTTT 300 

TGAATT&flcA A^GAfflTO r^TCATrrAAA GGAGCCTTAC MCATTATTC 350 

AlM^^jyd ATATCCCCAA AGGATTGTTC ATGCTCTGCT TGAAACTCAC 400 

S^^ C ^^AAGAAG ATCTGATCAT 450 

ffiffirg^ ^^^g tacTAAcrcc tqgaaagttA gatitaggtg soo 

AAATAACITT| GAATTACAA1 4 GAAGACGCCT ACAAAAGGAA ATATTTCCTT 550 

TCAAAAACAC TTGAATGTCT' TCCATCTAAC ATACAAACTA TGTCTTATTT 600 

AGAGAGGATd (^AATCCCTT CCTGGAAGAT AGACTTTGCC AGGGGAGAAA 650 

Tl^Mlte: TCCACAA!fCtf ATTTCAGiTG CAAAATCTTT GTTAAATCTT 700 

G^mMGOj GGMtAAAAA GAAAGAATCT AAGATTAAGG AAGCATATGC 750 

™ftff3Afc k AAATGATCTT' GCTGTGTCCA GCTTITTCTA ATTAtGTTAT 800 

GITl^TTC TflCTmCT TATAATTATT TITCTGTTTG TCA1TTCTTT 850 

CAAAMCCtG CTGimGTA GAAACCATAA AAACAAAAAT AAAAATAAAA 900 

TA^AiteAAA ATAAAATAAA AATCAAAAAA TGAAATAAAA GCAACAAAAA 950 

^^^^^ c y AA ^j xA - Aaaaagatcc CGAAAGGACA ATTTTGGCCA 1000 

WlWito TTT bTl'lTlT GTTTTTATTT 1050 
ffi 3 ™ ?^ ™™ MtITATITI ' ATTTTATGTT 1TIGTIGTIT 1100 
m-rlM'llT GTTATTTATT AAGCACAACA CACAGAAAGCA AACTTTAAT 1150 
TAAACACACT TMTTAAAAT iTAACACACT AAGCAAGCACA AACAATAAA 1200 
™^ GCTITATATA TTTATAGGCT TITTTATAAT TTAACITACA 1250 
GCTfcm*TrA AGC^GTTCT GTGAGTTTTG CCTGTTTTTT AACCCCAAAC 1300 
AOTGMAGA ACTTGTTAAG GGTTTCACTG TAATGTTCCA TAQCAATACT 1350 

tcctttaqca ttAggattgc tggagctaag tatagcagca tactctttcc 1400 
ccttcttcAc! ctGAtcrtcA ttcatttcaa atgcttttct titcagcaca 1450 

GTGCAAACTT' TTCCtMGQC TTCCCTQGTG TCATACTTCT TTGGGTCGAT 1500 

COCGA^CC TTCTAWtG CATCCTGATA TATAGCCAAG ACAACACTGA 1550 

TCMCTGAAA GCTATCAACf (3AAGCAATAA GAGGTAAGCT ACCTCOCAGC 1600 

AlTATGGCAA GCCTCAcAgA cWtfGCAtCA -TCAAGAGGTA ATCCATAGGC 1650 

TIGAATCAAA GGGTGGGAAG CMrcTTAGA TITGATAGTA TTGAGATTCT 1700 
CAGAATTCC 1709; 

TTM(AO\CT AAGCAA GCAC AAACAATAAA GAIAAAGAAA GCTTTATATA 50 

TITAlaGGCr ITT ITATAAT TTAACTrACA GCTCCTTTEA AGCAAG1TCT 100 

GTGAGfrrTTG OCauiWM' MCCCCAAAC A1TTCAIAGA ACTTCTTAAG ISO 

GbiYIUACTG TAATGlTCCA TAGCAATACT TCCTTTAGCA TTAGGATTGC 200 

TGGAGCTAAG TAT AGCAG CA TACItTrTCC COTCTTCAC CTGATCTTCA 250 

TICATTTCAA ATGCTTTTCT TTTCAGCACA GTGCAAACTT TICCTAAGGC 300 

TTCCCTGGTG TCATACTTCT TTGGGTCGAT CCCGAGATCC TTGTATTTTG 350 
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CMtXTGATA TAmGCCAAG ACAACACTGA TCATCTCAAA GCmTCAACT 400 

GAftfiCAA lAA GAGGTAAGCT ACCidXAGC ATTAIGGCAA GCCTCACAGA 450 

C MOJAI CA. Ift^G ASGTA MCCAIAGGC TTGACTCAAA GGGTGGGAAG 500 

CMLCTiAGA TTltSAIljASiA TftaAGAlTtT CAGAArrCCC AGTITCCTCA 550 

ACAAG<3CTt3A CCerGAltAA QcMStAAGC CTICTX3AAGG TCATGTCAGT 600 

GGClt£AATC CTGlt,l\3AAG ffittBff GGTAATIT1A (XAAAAGTAA 650 

AAffiGSfTflG CttMltMOC Tlt!A!nA3GC TCTGACGATT CTTCAGGAAT 700 

GTCAGACATG AaA1AA#GCT CAlUiTiTTG ATCTQGTCAA GGTITICCAG 750 

ACAAAA AGIt: TTGAA GyiG& ATTSdlACCAG A1TCTGATCT TCCTCAAACT 800 

CAAMUlTT GCC'l'M^iU McAaAGCAA CAATQCnTC CTIAGTGAGC 850 
TllftAcCA!r 858; 

AAAllOlLTl GCAGT GAAAT CICTQCTCAT GTTAGCAGAA AACAACATCA 50 

TOCcftAAcrrC t\Aago1tit gtcaaagctt ctactgattc TAATTTCAAG 100 

CTQA^CTCf GGCTAAGGGT' TOCAAAGGIT TTGAAGCAGA TITCCATTCA 150 

GaAAilViiiti AAGGTI^GCAG GAGATGAAAC AAATAAAACA TTITATTTAT 200 

UiM^IuCCTG GAITOGAaAC CATAAcAGTG TTGAGACAGC TTTAAACATT 250 

Aul^riAiTl 1 GCMGCAgcA GCiOXAATT CGTAAATGTA AAACTCCTIT 300 

TGAA£fATCA AltBAitelTiT CTGATTTAAA GGAGCCTTAC AACATEATTC 350 

AIGA^Cttt AtAlCCCCAA AGGATTGTTC AlGCTCTGCT iGAAACTCAC 400 

ACATWiTiG CACAAGTTCT' ItGCAACAAC TTGCAAGAAG ATGTGATCAT 450 

ClAGAGCTte AACAACCATG AGCTAACTCC TGGAAAGTTA GAITTAGGTG 500 

AAAtMCTIT GAAITTACAAT GAAGACGCCT ACAAAAGGAA ATA3TICCTT 550 

TCAAAAAcAG TTGAAiflGTCT TCCATCEAAC AIACAAACTA TGTCTTATTT 600 

AGACA6CATC CAAATCCCTT CCTGGAAGAT AGACnTQCC AGGGGAGAAA 650 

rtAAAATITC TCCACAATCT AiTTCAGTTG CAAAATCTTT GTTAAATCIT 700 

GAlTlAAGCG> GGATIAAAAA GAAAGAATCT AAGATTAAGG AAGCATATGC 750 

TIGA&ATC A AAATCAteTT GCflGTCIOCA GCITTTTCTA ATEATCTTAT 800 

ijd'iAiTriC I'i'lUi'i'lACT IataAtiatt TrrCTGTITG TCA lTlUm ' 850 

CAAA^POdTC uitfluiAGTA GAAACCATAA AAACAAAAAT AAAAATAAAA 900 

TAAAAitAAA AIAAAAtAAA AAIcAAAAaA TGAAATAAAA GCAACAAAAA 950 

A AllAAAA AA CAAAAAACC A AAAAAGATCC CGAAAGGACA ATTTTGGOCA 1000 

AAlTieGGGf I'i^'l'M'iGt TiTiTbTlTT TlTbTJLTJ.Tr GTITTTA TTT 1050 

itAiTirrAT TTnArriTr atitiatttt ATrrrATGrr mui'MTr 1100 

TIGTlATrTr GTTAlTrAlT AAGCACAACA CACAGAAAGC AMCTTTAAT 1150 

TAAAGACACT TArrtAAAAT TTAACACACT. AAGCAAGCAC AAACAATAAA 1200 

GATAAAGAAA GCTITATATA TTIATAGGCT TITTIATAA.T TTAACTTACA 1250 

GCTGC'fiTm AGCAAGTTCT GTCAGTTTTG CCT GTTmT AACCOCAAAC 1300 

A TTTGM AgA ACITGTEAAG GGTITCACPG TAATGITOCA TAGCAATACT 1350 

TX£TT2AGCA TIAGGATTGC 'TGGAGCTAAG TATAGCAGCA lACTCTTTCC 1400 

CCTTCfrCAC CiGAiCTTCA iTCATTTCAA A1GCTITTCT TTTCAGCACA 1450 



COCGAGATCC TTGTA1TITG CATCCTGATA TATAGCCAAG ACAACACTGA 1550 
TCAitJTCAAA GCTATCAACT GAAGCAATAA GAGGTAAGCT ACCTCCCAGC 1600 
ATTAiGGCAA GCCTCACAGA CTITGCATCA TCAAGAGGTA ATCCATAGGC 1650 



GTGGAAACTT 



TTCCTAAGGC 



TCAlACriCT TTCGGTCGAT 



1500 
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™ACrCAAA GGGTGGGAAG CAATCITAGA TITGATAGTA TTGAGATTCT 1700 

CAGftfcTOXC AGTrttiCTCA ACAAGCCTGA COCTGATCAA GOATCAAGC 1750 

L ' i ' M t ^ G ^^^^ ^QCTOCAATC CIGTCTGAAG TmmlK T 1800 

G&AAtffflA CCAAAAgtAA AA103CTITG OTAAiAACC TKA1TATCJC 1850 

T^DGMt CfiffgGM f GTtAGACATG AAATAATQCT CATClTmG 1900 

AJ^^ltAA GGll^fOGAG ACAAAAAGTC TTGAAGITGA ATGCTACCAG 1950 

frrffifoffi '^fe'^AAa^ tAAGGTCTIT GOCTTCTGrC AACAAAGCAA 2000 
CAATGCITTC CTIA GTCAGC ItAAcCAT 2028; £id 

AGAGGAAlftG GGTGAlfM? ^ATTCTAAAT CGAAOCTCAA CIAGCAAATC 50 

^SSH AGAGCACAAG AGCCACAATG TCATCAGGTC 100 

WM^AIC GAttAa-fcAG AcAAAGGCTT CAGTTIGGGG ATCGACAGCA 150 

T£EGgCAAGT GCAltjG^GGA frtfftACTGG ATTEATGAGT TTCCAACIGG 200 

TTCttfcACtfG GT^tCAAACffG AGftDGTACTC TGATTCGAGG AGCAAAAGTA 250 

GOTOOGCfEA CACl&AAAA AftGGTGATA TTCCTCCTGT AGAGGAGGAA 300 

AMmiCTC AGAAOdtttA lATCCCAGTG T1TGATGATA TIGAITTCAG 350 

CATGAAiATC AA3X3AfttlT ^K^ITGGCAAT TICTGTTTGT TOCAACACAG 400 

TTAACSAOCAA IflGGACrfGAAG CATCAGGGTC ATCTTAAAGT TCTITCTCTT 450 

GOCCAAlTGC AltxmTGA AOCTGTGATG AGCAGGTCAG AGAITCCEAG 500 

CAGAftOCGG CTCCAAGAAG AAGATATAAT T0CTGAIGAC AAAIAIATAT 550 

CTGOK3CTAA G^CGGA'TCl' CTCTOCTGTG TCAAAGAACA TACTTACAAA 600 

GTCGAAATGA GOCACAATCA GGCTTEAGGC AAAGTGAATC TTCTTTCTCC 650 

TAAGAGAAAfc GlrCATGAGf GGCTGTATAG TITCAAAOCA AAlTTCAACC 700 

AGATOGAaAG IMCAACAgA ACTGTAAATT CTCTIGCAGT O^AATCTITG 750 

CT^GGCTA GAGAAAACAA CATEATGOCT AACTCTCAAG C1TITCTTAA 800 

G S^ dl? G*^^ 1 * 'frAAGTIGAG CCITliGGCIG AGAATTCCAA 850 

AAGITOGAA GCAAAIAgCC AtACAGAAGC TCTTCAAGTT TGCAGGAGAC 900 

GAAAOCGGTA AAAgMctA iTTGTCTATT GCATGCATCC CAAATCACAA 950 

CAGttftGGAA ACAGCjftTAA AlGTCACTGT TATATGTAGA CATCAGCTTC 1000 

CAATOCCTAA GttCAAAGCT CCITITGAAT TATCAATGAT TITCTCCGAT 1050 

CTGAAAGAGC CfttACAACAC !i?GlGCATGAT (XTKATATC CTCAAAGGAT 1100 

TGntfttaC* ffiGCflDGAGA CTCACACITC CTTTGCACAA G1TCTCTX3CA 1150 

AO*AGC*GCA AGAAgAIGTG ASCA3ATATA CTAIAAACAG (XCTGAACTA 1200 

AOCCCAGCTA AGCtGGAlC¥ AQGTGAAAGA AOCITGAACT ACAGTGAAGA 1250 

TGCT^^AG AAGAAtftAlT iTCftTCAAA AACACTCGAA TGCTIGCCAG 1300 

TAAAltJiGCA GACtAftftCT* !iAiTTQGATA GCATOCAGAT TTCTTCAIGG 1350 

AAGAEAGACI' WgCCAGAGG AGAGATCAGA ATCTOCCCTC AATCTACTCC 1400 

TArrSGAAGA IdlTtGCrcA AGCTGGATIT GAGCAAGATC AAGGAAAAGA 1450 

A 2H?^ (: WigggaAa c a *o^ggtatg ATCTAGAATA AaAgTGGCTC 1500 

AAGtAG^AW ^GTCAAClTG CTTATCCTTT A l G Tl U l ' lTA 1550 

rlilriVlTAA AlCtAAAGTA AGT'EAGAITC AAGIAGTiTA GTATGCTATA 1600 

GCATtAlTAC AAAAAAIACA AAAAAAIACA AAAAAAIACA AAAAATA'IAA 1650 

AAAAOCCAAA AAGATdCCAA AAGGGACGAT TTGGTTGA2T TACTCTGTTT 1700 

TAGGCTIATC TAAGCTGCTT iTCITTGAGC AAAATAACAT TCTAACAIGC 1750' 

AAIAACTGGA AlTEAAAGTC CTAAAAGMG nTCAAAGGA CAGCTTAGCC 1800 



AAA Al'iGGTf ImTlMT TiTM'mT TT bTlTTA TT 1850 

TlTAfflTJA G ^M'P i'JT fTSMTTGIT M'lTlTATTT TTATTITA1T 1900 

TiLWriMT (ri relfflij ffP AiEAlATCAAA CACAAICCAC ACAAATAATT 1950 

TTAATIlCAA ACA!ftt?tACf Gatteaacac acteagcctg Actttatcac 2000 

AdfiAACACG C?HA(Maq3 t?Tt*EAACACA CTGAACTGAA tlAAAACACA 2050 

CliAG'iAl'iA IfGCAlt'lt'i'i 1 AAiTAACACA CTITAATAAT AlGCATCTCT 2100 

GAA^AGCff $AAA£Aft3G¥ ffiAiGCAAC ACCAGCAATC TTGGCCTCIT 2150 

TC?f^AACrcC AAACAfflGA ^AGAAftTGT CAAGA1TATC ACTG1AATAG 2200 

tocMagcaa tGcffigoffl agcattggga tigcaagaac taagtatcit 2250 

GC^SAfK* 'HtGCM^ ffiftltnGTCC ATCATCCATi GiAAATCCTT 2300 
TOffi'iMG CACft^GCAA ACXniCCCCA GAGCTTCCIT AGTGTTGTAC 2350 
flAiMGGft GAAi^jfit^EAA CTtCiTGTAC T1TGCATCTT GATATATGGC 2400 
AAGAACAACA CtGAiGAltft CGAAQCTGTC AACAGAAQCA ATGAGAGQGA 2450 
TAGijAOGTOC AAGCAfEAtA GCAAGTCTCA CAGATTTIGC ATCTGCCAGA 2500 
GGCAGCCCGT AAGClffGGAC CAAAQGGTGG GAGGCAATTT TTCCTTTGAT 2550 
AAHAGCAAGA . TTtrarrGt ftGCAGTCTC TTCTATGAGC TICACTCTTA 2600 
TC AlgCTAl t AAGCClXXTG AAAGTCATAT CCTTAGCTCC AACTCTTTCA 2650 
GAAgHTlCfr ITATOGlGAC CttAOCAAAA GTAAAATCAC TITGGITCAC 2700 
AAGfffi tAffA AlGOMfeGC GAttCrrCAA GAAAGTCAAA CATGAAGTGA 2750 
TA^^CAT^ dfaAAgAGG ftAAGATTIT CCTGACAGAA AGTCTEAAAG 2800 
TIGAAiGCGA GCTX9CM*tX3 GTCTICTICA AACTCAACAT CTGCAGAITG 2850 
AGBA AAAGA GAGAGAATGT 'I'l'lLTlTlGT GAGCTTGACC TIAGACATGG 2900 
TGGCAGTWA GATCftAGACd TTTCTCGAGA GATAAGATTC AAGGTGAGAA 2950 
A^GGAAGAG IGIAGACCGC GGTO3TTACT TATCCTGTEA ATGTGATGAT 3000 
TilMfliidT GAGTAttAGG iTTTTGAAIA AAATTGACAC AATTGCTCT 3049 

2. A plarit Susceptible to infection by Tospoviruses Which has a 
tfafiSgene inserted Into its genome to fender it resistant to infection by 
Tospoviruses, said tfahsgehe being selected from the group consisting 
of the hucle&proteifi gene of TSWV-BL, TSWV-toW, INSV-Ll, tsWV-B, a 
TdspOVifUs i said trflrtsgehe consisting of partial of full length 
hucledprdteih gene sequences fforri TSWV-BL, TSWV-toW, TSWV-B, 
INSV-Beg and INSV-IL, the translatable or untranslatable sequences of 
Said liUcleopfdteirt gene Sequences, and the sense or antisense 
sequences of said hucleepfoteih gene sequences. 

3. A methdd fof providing a host plant With fesistahce to 
infection by tospdvltUs§6 which comprises inserting a transgene into 
the host plant which g^ne is Selected from the nucleopfotein gene of 
tSWV-BL, TSVW-1dW, IMSV-Beg, INSV-Ll, TSWV-B, or mixtures of 
nucleotide sequences t&ken from the nucleoprotein gene. 
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